Skip to content

Multiple search#172

Open
ksmith2002 wants to merge 6 commits intomainfrom
multiple_search
Open

Multiple search#172
ksmith2002 wants to merge 6 commits intomainfrom
multiple_search

Conversation

@ksmith2002
Copy link
Contributor

Adds the ability to search for multiple institutions

@chinardankhara
Copy link
Contributor

@ksmith2002 @lewlefton

Following up from the Thursday meeting, I have a couple of ideas:

  1. If the intended use for multiple institutions is <=5 items, we can change the institutions search bar to multi-select.
  2. If we want to support arbitrary number of selections, we can remove the checkboxes, and support a CSV upload of institutions name which we can sanitize and do fuzzy search on to return the right institutions naturally.

Happy to pick this PR further.

@lewlefton
Copy link
Contributor

The options @chinardankhara mentions are both useful.

Not sure how multiselect plays with autocomplete, but if it filters on the fly, I don't see why we need to stop at <=5 items?

We do want to support arbitrary number of selections. Removing the checkboxes is fine as long as we have some alternative. CSV upload could initially just be OpenAlex IDs (or ROR ids, or URLs with a mix of openalex and ror API calls). That should be simplest to implement (less work on our end and shows proof of concept). Having users input arbitrary institution names brings up a lot of other UI/UX issues. Even if we sanitize/fuzzy search for a better initial list, there will be subtle errors and the need for user validation creating extra steps and potential unresolvable ambiguities that will require users to input PIDs like OpenAlex or ROR ids anyway.

I think for a first pass, requiring a single column CSV with OpenAlex and/or ROR URIs seems like we can at least provide results cleanly, even if the user has to work harder pre-upload. That will allow us to get the visualization and data components in place and then we can work on simplifying user input.

Happy to chat pros and cons of these and other approaches further. This is a design decision which won't likely be the final one. Another consideration is the uploading a CSV of people should be similar to Orgs, and a list of Open Alex IDs eliminates the harder name disambiguation

@ksmith2002
Copy link
Contributor Author

I've added an update which allows you to upload a .csv file with OpenAlex IDS such as this:
I130701444
I85230861
I150468666

Right now, in order for it to work the other fields must be empty because it defaults to them and I've left the checkbox to add another institution

As you mentioned this can be updated in the future, but this is a first update in order to allow uploading the csv to function.

@gladishd
Copy link
Contributor

gladishd commented May 25, 2025

God this thing is whack!
Screenshot 2025-05-24 at 11 00 26 PM
I can only assume Cañada College means Cañada College..
Screenshot 2025-05-24 at 11 04 23 PM
Screenshot 2025-05-24 at 11 05 18 PM
Screenshot 2025-05-24 at 11 03 38 PM
Update: Today I tried to reproduce the error (the one where I get a long list saying "University of California" over and over again..), and I could not reproduce it so perhaps that's a bit of a far fetched glitch. In the meantime, I should compile a list of formatted name changes for the database. UTF-8 aside. institutions.csv.
Cañada College -> Cañada College
Hannibal–LaGrange University -> Hannibal–LaGrange University
Brigham Young University–Hawaii -> Brigham Young University–Hawaii
University of Nebraska–Lincoln -> University of Nebraska–Lincoln
Baldwin–Wallace College -> Baldwin–Wallace College
University of HawaiÊ»i at MÄ�noa -> University of Hawaiʻi at Mānoa
University of Arkansas – Fort Smith -> University of Arkansas – Fort Smith
Texas A&M University – San Antonio -> Texas A&M University – San Antonio
Indiana University – Purdue University Fort Wayne -> Indiana University – Purdue University Fort Wayne
University of Puerto Rico at Río Piedras -> University of Puerto Rico at Río Piedras
Lenoir–Rhyne University -> Lenoir–Rhyne University
Texas A&M University – Kingsville -> Texas A&M University – Kingsville
Randolph–Macon College -> Randolph–Macon College
Freed–Hardeman University -> Freed–Hardeman University
Universidad José María Vargas -> Universidad José María Vargas
Pünkösdi Teológiai FÅ‘iskola -> Pünkösdi Teológiai Főiskola
IḷisaÄ¡vik College -> Iḷisaġvik College
Baptist University of the Américas -> Baptist University of the Américas
Institut des Hautes Études de Management -> Institut des Hautes Études de Management
Médecins Sans Frontières -> Médecins Sans Frontières
Texas A&M University – Commerce -> Texas A&M University – Commerce
University of Mary Hardin–Baylor -> University of Mary Hardin–Baylor
Hampden–Sydney College -> Hampden–Sydney College
LeMoyne–Owen College -> LeMoyne–Owen College
Gardner–Webb University -> Gardner–Webb University
Colby–Sawyer College -> Colby–Sawyer College
Chandler–Gilbert Community College -> Chandler–Gilbert Community College
Kenrick–Glennon Seminary -> Kenrick–Glennon Seminary
Hardin–Simmons University -> Hardin–Simmons University
Orangeburg–Calhoun Technical College -> Orangeburg–Calhoun Technical College
Pasco–Hernando State College -> Pasco–Hernando State College
Florence–Darlington Technical College -> Florence–Darlington Technical College
Embry–Riddle Aeronautical University -> Embry–Riddle Aeronautical University (Note: The file contains "Embry-Riddle Aeronautical University Prescott Arizona Campus" which uses a standard hyphen, and "Embry–Riddle Aeronautical University" which already uses an en dash. If "Embry–Riddle Aeronautical University" with the mojibake – was a separate entry you were concerned about, the correction to an en-dash – is appropriate.)
Atlanta’s John Marshall Law School -> Atlanta’s John Marshall Law School

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants