Recovered Memories: Bringing the Air Force Archive Into the Digital Age

dc.contributor.advisorKOHL, Deborah
dc.contributor.advisorBLODGETT, Bridget
dc.contributor.advisorWOLF, Richard
dc.contributor.authorBYRD, David Allan
dc.contributor.departmentUniversity of Baltimore. Yale Gordon College of Arts and Sciencesen_US
dc.contributor.programUniversity of Baltimore. Doctor of Science in Information and Interaction Designen_US
dc.date.accessioned2023-04-17T19:47:59Z
dc.date.available2023-04-17T19:47:59Z
dc.date.issued2023-05-31
dc.descriptionWithout 100 percent accurate content, usability and information architecture mean little. Pristine content is the sine qua non of the user experience.en_US
dc.descriptionD.S. -- The University of Baltimore, 2023
dc.descriptionDissertation submitted to the Yale Gordon College of Arts and Sciences of The University of Baltimore in partial fulfillment of the requirements for the degree of Doctor of Science in Information and Interaction Design
dc.description.abstractRecruiting unpaid volunteers through a “crowdsourcing” technique has become a near-ubiquitous tactic of libraries, archives, and other institution seeking to textually digitize their analog holdings. Determining 1) certain demographic characteristics of those volunteers, 2) their familiarity with the topic, 3) their motivation, and 4) the process they use that correlate with higher performance in that task, has been little studied. Recovered Memories investigates 9 such variables both individually and combined. Optical character recognition (OCR) technology is one automated method for converting text, but has proven to be unsatisfactory for creating web content or e-books, mining data, creating data for artificial intelligence (AI) and machine learning (ML) software, and even some search functions. This paper theorized that some of the variables studied will correlate with higher performance. This research project examined the efficacy of a custom-built application to gather data (www.airforcehistory.net) One hundred and twelve historic documents from the Air Force Historical Research Agency’s archive were used in the examination to measure participants’ performance. Despite a relatively small sample size (n=50) and the lack of control endemic to field research, the participant variables of ‘familiarity with U.S. history in Vietnam (PV5),’ ‘process choice (PV8),’ and ‘age (PV2), group affiliation (PV7), and ‘familiarity with U.S. Air Force operations (PV6)’ were related. Multiple regression showed three factors correlated with better performance: gender, familiarity/Vietnam, and process selection. The author argues that OCR correction rather than copying/transcription, i.e. process choice, results in best performance and might be generalizable. Given the interest at the federal level in textually digitizing the holdings of military archives, this study has strong implications for policy and practice.en_US
dc.format.extent220 leavesen_US
dc.format.mimetypeapplication/pdf
dc.genredissertationsen_US
dc.identifierdoi:10.13016/m23g2p-5vqr
dc.identifier.otherUB_2023_Byrd_D
dc.identifier.urihttp://hdl.handle.net/11603/27621
dc.language.isoen_USen_US
dc.rightsAttribution-NonCommercial-NoDerivs 3.0 United States*
dc.rightsThis item may be protected under Title 17 of the U.S. Copyright Law. It is made available by The University of Baltimore for non-commercial research and educational purposes.
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/3.0/us/*
dc.subjectcrowdsourcingen_US
dc.subjectocren_US
dc.subjectArchivesen_US
dc.subjectdigital historyen_US
dc.subjectoptical character recognitionen_US
dc.subjectanalog to digital conversionen_US
dc.titleRecovered Memories: Bringing the Air Force Archive Into the Digital Ageen_US
dc.title.alternativeAttributes Correlated with Improved Performance in Textually Digitizing Analog Documentsen_US
dc.typeTexten_US

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
David Byrd Dissertation 4-2023 SIGNED.pdf
Size:
3.85 MB
Format:
Adobe Portable Document Format
Description:
Dissertation text
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.71 KB
Format:
Item-specific license agreed upon to submission
Description: