How AI Rewrites the Archive—For Better or Worse

Jon Ippolito
Professor of New Media
Director, Digital Curation graduate program
The University of Maine
DigitalCuration.UMaine.edu

🐦/🧡 @jonippolito 🐘 @jonippolito@digipres.club πŸ¦‹ @jonippolito.bsky.social πŸ”— https://www.linkedin.com/in/jonippolito
Wired Headline Ai Coming For Lawyers
Wired Headline Ai Coming For Lawyers

Will Generative AI rescue or replace culture?

ai_reconstruction_rembrandt_night_watch_annotated.png

Missing edges of Rembrandt painting The Night Watch restored by the Rijksmuseum using AI.

library_of_virginia_ai_black_ancestor.png

AI images generated by the Library of Virginia using the description of John Butler from the Fairfax County Register of Free Negroes, 1822-1841.

IMPACT RISK framework for understanding AI risks

ai_impact_risk_acronym

https://ai-impact-risk.com/

Improved access to collections

imgs_dot_ai_collection_similar_to_night_watch

Imgs.ai

Image generation in place of search?

leonardo_ai_realtime_generator_horse

Leonardo.ai Realtime Generation

Replacing the entire Internet?

websim_ai_malaysia

Websim.ai

Interpreting data

Demo analyzing Digital Curation student backgrounds

Can ChatGPT analyze qualitative rather than just quantitative results?

Chat transcript

πŸ‘‰πŸŒŸ Try it!

Beginner exercise

Advanced exercise

Design a per se law school exercise for Courtroom 5 (by April Dawson)

Free chatbots

How generative AI works

learnwithai_average_trail
learnwithai_average_trail

β€œI'm trying to plan a 22 mile trail route in a city in my state. I asked ChatGPT for route suggestions a few times and each time it just made stuff up--once it made up a trail that doesn't exist (to connect two other trails that do) and other times to say to go from one trail to another when the trails are miles apart”

β€”Katie, New Mexico, March 2023

generative_ai_0_text_background_1080
generative_ai_0_text_background_1080
generative_ai_1_text_original_1080
generative_ai_1_text_original_1080
generative_ai_2_text_trained_1080
generative_ai_2_text_trained_1080
generative_ai_3_image_original1080
generative_ai_3_image_original1080

Associating images with words

Uploading images
Alt Tag In Wordpress Star Wars
Viewing images
Alt Tag In Html Star Wars
generative_ai_4_image_trained_1080
generative_ai_4_image_trained_1080
generative_ai_5_latent_average_1080
generative_ai_5_latent_average_1080
generative_ai_6_latent_average_1080
generative_ai_6_latent_average_1080

Practical applications for archives

Sara and Ben Brumfield Sara and Ben Brumfield
Sara and Ben Brumfield, FromThePage
Sara and Ben Brumfield
Sara and Ben Brumfield
Computer control
Demo by Matt Wolfe

Successes

Tags

Metadata as averages

Fuzzy matches

Transcribing messy handwritten documents.

Data mashups

Find place names in text and geolocate them on a map.

Summaries

NotebookLM podcast generated from MAM schedule

Problems

Specious output

ChatGPT's handwriting recognition is dangerous because plausible even when incorrect.

Overconfidence

FromThePage checkbox anecdote

So how should be approach AI?

High v. low stakes as determinant

High v low stakes (as represented by poker chips)

The problem with high v. low stakes

x-ray AI-generated slack comment

Low stakes

Google Olympics Ai Fan Letter Ad

Opportunistic v. prescriptive tasks

cracking_a_safe
cracking_a_safe
cutting_bomb_wire
cutting_bomb_wire

Opportunistic v. prescriptive tasks

medieval_soldiers_breaching_fortress_alamy medieval_castle_aerial_drawing_carneycastle

Opportunistic v. prescriptive tasks

AI 🀝 crowdsourcing

crowdsourced_transcription_apps_xvga

Crowdsource human volunteers to check the results of AI-generated solutions.

Back to title screen