White Paper: AI Perceptions at the University of Baltimore


This white paper, produced by the UBalt AI team, explores the perceptions of Artificial Intelligence (AI) and generative AI within the UBalt community. It aims to uncover how students, faculty, and staff view AI’s role and implications in the educational landscape. The university collaborated with Ithaka S+R to acquire established, reliable and valid surveys from the AI literature, which was then adapted by the UBalt AI team to meet the needs of our academic community. This survey included a blend of both quantitative and qualitative questions, ensuring a deep understanding of the respondents’ views. . . . The responses obtained were then analyzed using descriptive and inferential statistics, as well as an exploratory qualitative analysis to extract meaningful insights, setting the stage for informed discussions and decision-making around AI in education.

http://tinyurl.com/mr47zx3j

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"Deepfake Scammer Walks off with $25 Million in First-of-Its-Kind AI Heist"


The scam featured a digitally recreated version of the company’s chief financial officer, along with other employees, who appeared in a video conference call instructing an employee to transfer funds.

http://tinyurl.com/9aspy8u7

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"Could AI Change the Scientific Publishing Market Once and for All?"


Artificial-intelligence tools in research like ChatGPT are playing an increasingly transformative role in revolutionizing scientific publishing and re-shaping its economic background. They can help academics to tackle such issues as limited space in academic journals, accessibility of knowledge, delayed dissemination, or the exponential growth of academic output. Moreover, AI tools could potentially change scientific communication and academic publishing market as we know them. They can help to promote Open Access (OA) in the form of preprints, dethrone the entrenched journals and publishers, as well as introduce novel approaches to the assessment of research output. It is also imperative that they should do just that, once and for all.

https://arxiv.org/abs/2401.14952

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"On the Readiness of Scientific Data for a Fair and Transparent Use in Machine Learning"


To ensure the fairness and trustworthiness of machine learning (ML) systems, recent legislative initiatives and relevant research in the ML community have pointed out the need to document the data used to train ML models. Besides, data-sharing practices in many scientific domains have evolved in recent years for reproducibility purposes. In this sense, the adoption of these practices by academic institutions has encouraged researchers to publish their data and technical documentation in peer-reviewed publications such as data papers. In this study, we analyze how this scientific data documentation meets the needs of the ML community and regulatory bodies for its use in ML technologies. We examine a sample of 4041 data papers of different domains, assessing their completeness and coverage of the requested dimensions, and trends in recent years, putting special emphasis on the most and least documented dimensions. As a result, we propose a set of recommendation guidelines for data creators and scientific data publishers to increase their data’s preparedness for its transparent and fairer use in ML technologies.

https://arxiv.org/abs/2401.10304

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"ChatGPT in Medical Libraries, Possibilities and Future Directions: An Integrative Review"


Positioned as a review, our study elucidates the applications of ChatGPT in medical libraries and discusses relevant considerations. The integration of ChatGPT into medical library services holds promise for enhancing information retrieval and user experience, benefiting library users and the broader medical community.

https://doi.org/10.1111/hir.12518

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"Launch of Scopus AI to Help Researchers Navigate the World of Research "


Scopus AI is based on Scopus’ trusted content from over 27,000 academic journals, from more than 7,000 publishers worldwide, with over 1.8 billion citations, and includes over 17 million author profiles. Scopus content is vetted by an independent board of world-renowned scientists and librarians who represent the major scientific disciplines.

Since the alpha launch in August 2023, thousands of researchers across the world have tested Scopus AI. Their feedback has reinforced that, as generative AI evolves, researchers want trustworthy, cited research that is relevant and highly personalized to their needs.

Feedback from the research community has led to Scopus AI offering the following powerful features:

  • Expanded and Enhanced Summaries that provide researchers with fast overviews of key topics that they can dig deeper into, sometimes even highlighting gaps in literature. . . .
  • Foundational and Influential Papers that enable researchers to rapidly pinpoint seminal works, navigating academic progress and impact with precision and ease.
  • Academic Expert Search identifies leading experts in their fields and provides explanations of their expertise relevant to the user’s query, helping save time.
  • Enhanced breadth of research, covering ten years of Scopus content to support well-rounded perspective on topics of interest, and improved design to enhance the user experience.

http://tinyurl.com/22f78hv6

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

Paywall: "Can ChatGPT Identify Predatory Biomedical and Dental Journals? A Cross-Sectional Content Analysis"


ChatGPT may effectively distinguish between predatory and legitimate journals, with accuracy rates of 92.5% and 71%, respectively. The potential utility of large-scale language models in exposing predatory publications is worthy of further consideration.

https://doi.org/10.1016/j.jdent.2024.104840

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"Very Scary’: Mark Zuckerberg’s Pledge to Build Advanced AI Alarms Experts"


The Meta chief executive has said the company will attempt to build an artificial general intelligence (AGI) system and make it open source, meaning it will be accessible to developers outside the company. The system should be made "as widely available as we responsibly can," he added.

AGI? "An artificial general intelligence (AGI) is a hypothetical type of intelligent agent. If realized, an AGI could learn to accomplish any intellectual task that human beings or animals can perform. Alternatively, AGI has been defined as an autonomous system that surpasses human capabilities in the majority of economically valuable tasks."

http://tinyurl.com/2apt6kh6

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"How Can Universities Create AI Tools for their Communities? An Interview with the Creators of UC San Diego’s TritonGPT"


Since then, the University of California San Diego has launched TritonGPT, currently available in beta by invitation only. TritonGPT, a language model with a ChatGPT-like interface, was trained to answer detailed questions about UC San Diego’s policies, procedures, and campus life. Though TritonGPT is designed to serve a purpose very similar to the platforms at Harvard, Michigan, and Knoxville, UC San Diego’s approach stands out because instead of relying on Microsoft’s Azure OpenAI service, TritonGPT is hosted on local infrastructure and optimized for use in administrative operations.

https://tinyurl.com/5t3bwedm

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"Google Launches Gemini, the AI model It Hopes Will Take Down GPT-4"


Gemini is more than a single AI model. There’s a lighter version called Gemini Nano that is meant to be run natively and offline on Android devices. There’s a beefier version called Gemini Pro that will soon power lots of Google AI services and is the backbone of Bard starting today. And there’s an even more capable model called Gemini Ultra that is the most powerful LLM Google has yet created and seems to be mostly designed for data centers and enterprise applications.

https://tinyurl.com/5n8jp9n2

| Artificial Intelligence and Libraries Bibliography |
Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"If Creators Suing AI Companies Over Copyright Win, It Will Further Entrench Big Tech"


The almost certain outcome [of copyright suits against AI companies](because it’s what happens every other time a similar situation arises) is that there will be one (possibly two) giant entities who will be designated as the "collection society" with whom AI companies will have to negotiate or to just purchase a "training license" and that entity will then collect a ton of money, much of which will go towards "administration," and actual artists will… get a tiny bit.. . .

But, given the enormity of the amount of content, and the structure of this kind of thing, the cost will be extremely high for the AI companies (a few pennies for every creator online can add up in aggregate), meaning that only the biggest of big tech will be able to afford it.

https://tinyurl.com/y4azzsdt

| Artificial Intelligence and Libraries Bibliography |
Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

STM: "New White Paper Launch: Generative AI in Scholarly Communications"


The paper looks at the ethical, legal, and practical aspects of GenAI, highlighting its potential to transform scholarly communications, and covers a range of topics from intellectual property rights to the challenges of maintaining integrity in the digital age. The paper provides best-practice principles and recommendations for authors, editorial teams, reviewers, and vendors, ensuring a responsible and ethical approach to the use of GenAI tools.

https://tinyurl.com/4m6m8n9j

| Artificial Intelligence and Libraries Bibliography |
Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"Beyond the Hype Cycle: Experiments with ChatGPT’s Advanced Data Analysis at the Palo Alto City Library"


In June and July of 2023 the Palo Alto City Library’s Digital Services team embarked on an exploratory journey applying Large Language Models (LLMs) to library projects. This article, complete with chat transcripts and code samples, highlights the challenges, successes, and unexpected outcomes encountered while integrating ChatGPT Pro into our day-to-day work.

Our experiments utilized ChatGPTs Advanced Data Analysis feature (formerly Code Interpreter). The first goal tested the Search Engine Optimization (SEO) potential of ChatGPT plugins. The second goal of this experiment aimed to enhance our web user experience by revising our BiblioCommons taxonomy to better match customer interests and make the upcoming Personalized Promotions feature more relevant. ChatGPT helped us perform what would otherwise be a time-consuming analysis of customer catalog usage to determine a list of taxonomy terms better aligned with that usage.

In the end, both experiments proved the utility of LLMs in the workplace and the potential for enhancing our librarian’s skills and efficiency. The thrill of this experiment was in ChatGPT’s unprecedented efficiency, adaptability, and capacity. We found it can solve a wide range of library problems and speed up project deliverables. The shortcomings of LLMs, however, were equally palpable. Each day of the experiment we grappled with the nuances of prompt engineering, contextual understanding, and occasional miscommunications with our new AI assistant. In short, a new class of skills for information professionals came into focus.

https://journal.code4lib.org/articles/17867

| Artificial Intelligence and Libraries Bibliography |
Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

Paywall — NYT: "Ego, Fear and Money: How the A.I. Fuse Was Lit"


The people who were most afraid of the risks of artificial intelligence decided they should be the ones to build it. Then distrust fueled a spiraling competition. . . .

Over dinner, Mr. Gates told them he doubted that large language models could work. He would stay skeptical, he said, until the technology performed a task that required critical thinking & passing an A.P. biology test, for instance. . . .

[Five months later] Mr. Brockman gave the system [GPT-4] a multiple-choice advanced biology test, and Ms. Voss graded the answers. . . .

There were 60 questions. GPT-4 got only one answer wrong.

Mr. Gates sat up in his chair, his eyes opened wide. In 1980, he had a similar reaction when researchers showed him the graphical user interface that became the basis for the modern personal computer. He thought GPT was that revolutionary.

https://tinyurl.com/mvjs3z3k

| Artificial Intelligence and Libraries Bibliography |
Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

IFLA AI SIG: Developing a Library Strategic Response to Artificial Intelligence


The strategy most aligned to existing library practices and librarian identities, particularly in university, school and public libraries, is to take a lead role in promoting AI literacy. There is a widespread understanding that the public, as citizens and workers need to understand the new technologies. Students, whatever discipline they are studying, need such knowledge for employability. . . .

AI literacy is likely to include the ability to identify when AI is being used; to appreciate the differences between narrow and general AI; to understand what types of problem AI is good at solving; to understand how machine learning models are trained. It would also include awareness of ethical issues such as bias, privacy, explainability and social impact.

https://tinyurl.com/s6r6czrh

| Artificial Intelligence and Libraries Bibliography |
Research Data Curation and Management Works | | Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"AI for Academia: Digital Science Acquires Writefull to Empower Researchers and Publishers"


Writefull’s AI language models are trained on billions of sentences taken from millions of journal articles. Matched with a firm commitment to data privacy, this means its models offer unparalleled assistance to users in academic writing, paraphrasing, copy editing and revisions. . . .

Writefull’s language services are now used by students and researchers at more than 1,500 institutions, and are integrated into the workflows of top publishers and copy editors, such as at the American Chemical Society (ACS), Hindawi, the British Ecological Society, Sage, and the Royal Society of Chemistry (RSC). Writefull’s APIs are also integrated with Digital Science’s collaborative LaTeX editor Overleaf.

https://tinyurl.com/ywyap23p

| Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"Introducing the LC Labs Artificial Intelligence Planning Framework"


To account for these [AI] challenges and realities, LC Labs has been developing a planning framework to support the responsible exploration and potential adoption of AI at the Library. At a high level, the framework includes three planning phases: 1) Understand 2) Experiment and 3) Implement, each supports the evaluation of three elements of ML: 1) Data; 2) Models; and 3) People. We’ve developed a set of worksheets, questionnaires, and workshops to engage stakeholders and staff and identify priorities for future AI enhancements and services. The mechanisms, tools, collaborations, and artifacts together form the AI Planning Framework. Our hope in sharing the framework and associated tools in this initial version is to encourage others to try it out and to solicit additional feedback. We will continue updating and refining the framework as we learn more about the elements and phases of ML planning.

https://tinyurl.com/4wuakvjy

| Artificial Intelligence and Libraries Bibliography |
Research Data Curation and Management Works | | Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"Approaching Artificial Intelligence and Open Research in Sync: Opportunities and Challenges"


  • AI can generate more complete and disambiguated metadata to enhance discovery and move search from the traditional keyword-based model to semantic and conversation-based searches.
  • AI can also help publishers improve accessibility, to make content available to a broader audience.
  • AI as a reader and consumer will become as important a consideration as the human reader and consumer. Publications should consider machines as consumer and provide machine readable and consumable formats.
  • AI can create personalized recommendations and news feeds, simultaneously helping researchers find the answers they need and allowing publishers to target specific audiences for specific publications.
  • Even better, AI can perform reverse engineering to measure the contribution of each source to the final answers. And publishers can charge based on the contribution. This could be new business model in the future. Many AI researchers are currently working on enabling explainable and transparent AI, but this research will take time.

https://tinyurl.com/uu4dhs9y

| Artificial Intelligence and Libraries Bibliography |
Research Data Curation and Management Works | | Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"Judge Will Toss Part of Authors’ AI Copyright Lawsuit "


According to Reuters, judge Vince Chhabria said the authors’ allegations that text generated by Llama infringes their copyrights simply doesn’t stand up to scrutiny. "When I make a query of Llama, I’m not asking for a copy of Sarah Silverman’s book—I’m not even asking for an excerpt," Chhabria observed, noting that, under the authors’ theory, a side-by-side comparison of text generated by the AI application and Silverman’s book would have to show they are similar.

However, the judge said he will not dismiss the case with prejudice, meaning the authors will be allowed to amend and refile their claims.

https://tinyurl.com/sd4wbba4

| Research Data Publication and Citation Bibliography | Research Data Sharing and Reuse Bibliography | Research Data Curation and Management Bibliography | Digital Scholarship |

"Google SGE [Search Generative Experience]: A New Way to Search, Teach, and Resist"


Google SGE removes many of the barriers that make us doubt our search abilities. We already know that users rarely look past the first page of results or scroll past the fold of a webpage, but with SGE you get exactly what you think is "good enough." However, the more I searched the more disappointed I was that Google continued to serve up the same kinds of sources you usually find at the top of the algorithm, such as Wikipedia pages, blog posts, news, and popular media. The only disclaimer that SGE gives is "Info quality may vary."

https://tinyurl.com/4tntbsbh

| Artificial Intelligence and Libraries Bibliography |
Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"How Well Does ChatGPT Handle Reference Inquiries? An Analysis Based on Question Types and Question Complexities"


To explore whether artificial intelligence can be used to enhance library services, this study used ChatGPT to answer reference questions. . . Overall ChatGPT’s performance was fair, but it did poorly in information accuracy. It scored the highest when handling facilities and equipment-related questions but the lowest when dealing with e-resources access problems. ChatGPT was weak in answering advanced research questions, complex inquiries, and known item searches relating to a specific local environment, but it could be adopted to enhance library communication with users.

https://tinyurl.com/3dabv5f8

| Artificial Intelligence and Libraries Bibliography |
Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

Paywall: Generative AI and Librarians — "The Prompt Engineering Librarian"


In terms of training the public in prompt engineering skills, no single discipline or profession currently takes the lead, presenting an opportunity for professions like librarianship to step into this role. Librarians are already well-equipped to educate the public in a wide range of literacy skills and tasks, so prompt engineering may be a natural progression. The purpose of this paper is to examine the potential role of prompt engineering for library professionals.

https://doi.org/10.1108/LHTN-10-2023-0189

Also see: "Prompt Engineers or Librarians? An Exploration."

| Artificial Intelligence and Libraries Bibliography |
Research Data Curation and Management Works | | Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"Developing Responsible AI practices at the Smithsonian Institution"


Applications of artificial intelligence (AI) and machine learning (ML) have become pervasive in our everyday lives. These applications range from the mundane (asking ChatGPT to write a thank you note) to high-end science (predicting future weather patterns in the face of climate change), but, because they rely on human-generated or mediated data, they also have the potential to perpetuate systemic oppression and racism. For museums and other cultural heritage institutions, there is great interest in automating the kinds of applications at which AI and ML can excel, for example, tasks in computer vision including image segmentation, object recognition (labelling or identifying objects in an image) and natural language processing (e.g. named-entity recognition, topic modelling, generation of word and sentence embeddings) in order to make digital collections and archives discoverable, searchable and appropriately tagged.

A coalition of staff, Fellows and interns working in digital spaces at the Smithsonian Institution, who are either engaged with research using AI or ML tools or working closely with digital data in other ways, came together to discuss the promise and potential perils of applying AI and ML at scale and this work results from those conversations. Here, we present the process that has led to the development of an AI Values Statement and an implementation plan, including the release of datasets with accompanying documentation to enable these data to be used with improved context and reproducibility (dataset cards). We plan to continue releasing dataset cards and for AI and ML applications, model cards, in order to enable informed usage of Smithsonian data and research products.

https://doi.org/10.3897/rio.9.e113334

| Artificial Intelligence and Libraries Bibliography |
Research Data Curation and Management Works |
| Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |

"NEH Announces New Research Initiative: Humanities Perspectives on Artificial Intelligence"


NEH’s Humanities Perspectives on Artificial Intelligence initiative will support numerous AI-related humanities projects through the following funding opportunities:

https://tinyurl.com/c5sb7x26

| Artificial Intelligence and Libraries Bibliography |
Research Data Curation and Management Works | | Digital Curation and Digital Preservation Works |
| Open Access Works |
| Digital Scholarship |