The UCSF Industry Documents Library is pleased to highlight the work of 2024 Summer Fellow Gordon Lichtstein. Gordon is an incoming MIT student with an interest in the intersection of computer science and linguistics in NLP and the application of NLP for the betterment of humanity such as in environmental sustainability or the digital humanities.
Over the course of the 8-week internship, Gordon crafted and completed four distinct projects that leverage natural language processing and data science within the context of our JUUL Labs Collection and the broader IDL. Project One investigates the optical character recognition (OCR) accuracy of low-quality and handwritten documents in the absence of ground truth data. Project Two explores the implementation of embedding search algorithms and visualizations aimed at enhancing the relevance of document recommendations for users. Project Three employs txt-ferret to conduct a thorough scan of a substantial corpus of industry documents to identify sensitive information, including credit card numbers. Finally, Project Four assesses the biases present in large language model (LLM) summarization through the lens of sentiment analysis.
Read Gordon's entire report and reflection via eScholarship.
The IDL staff is deeply appreciative of Gordon's thoughtful and comprehensive contributions, as well as his engagement in team meetings and Amazon Web Services workshops. His projects and use of NLP techniques with our document corpus have greatly enriched our understanding.
The UCSF-JHU Opioid Industry Documents Archive (OIDA) added more than 34,000 documents to the Insys Litigation Documents collection. This documents drop contains emails and spreadsheets from 2014 discussing the many aspects of Insys's business activities, ranging from sales tracking spreadsheets to speakers bureau training.
The Insys collection ultimately will contain several million documents that are currently being processed chronologically. Processed documents are being made public on a rolling basis with monthly releases expected in 2023–2024.
100 documents were added to the USRTK Food Industry collection today. These documents were acquired by US Right to Know (USRTK) during their ongoing investigations into the influence of large food and beverage companies on academic partnerships and government regulatory processes around sugary beverages and obesity, among other topics.
77 new depositions and trial testimony documents were added to the DATTA collection today. These transcripts were donated by Dr. Robert Proctor and come from his efforts as expert witness in tobacco and smokeless tobacco litigation.
The recording of the May 5th webinar is now available on the event page!
Many thanks to Cecilia Tomori and Jason Chernesky for MC'ing this amazing event and to our wonderful presenters, Keith Wailoo, Kate Tasker, Adam Koon, and Gaurab Bhardwaj.
Opioid Industry Documents Archive
The UCSF-JHU Opioid Industry Documents Archive added more than 100,000 documents to the Insys Litigation Documents collection. The documents are from 2013 and 2014 and document many aspects of Insys's business activities, ranging from sales tracking spreadsheets to speakers bureau training to Reimbursement Center emails.
This release brings the total number of public Insys documents to more than 1.5 million; the Insys collection ultimately will contain several million documents that are currently being processed chronologically. Processed documents are being made public on a rolling basis with monthly releases expected in 2023–2024.
Food Industry Documents Archive Update
735 documents were added to the USRTK collection today, bringing the total number of documents to over 41,000! These documents were acquired by US Right to Know (USRTK) during their ongoing investigations into the influence of large food and beverage companies on academic partnerships and government regulatory processes around sugary beverages and obesity, among other topics.
Registration open for the Annual Tobacco and Industry Documents Workshop
Friday, May 12, 2023
9:30am - 2:15pm (PT)
This workshop will be a virtual event.
The UCSF Center for Tobacco Control Research and Education (CTCRE) will once again hold it's annual one-day workshop on using tobacco and other industry documents for advocacy. This popular workshop highlights how lessons learned from tobacco industry documents also apply to chemical, food, drug and opiate industries.
This year, the annual Tobacco and Industry Documents Workshop is collaborating with the University of California, San Francisco (UCSF) Philip R. Lee Institute for Health Policy Studies and Breast Cancer Action to uplift the use of the Industry Documents Library for breast cancer advocacy, campaign development, and research collaborations.
------------------------------------------------------------
Join us on May 5, 2023 for Exploring the Opioid Industry Documents: Research Communities, Educational Opportunities, and Community Data. This event will feature a webinar where scholars will discuss how they successfully use OIDA and other Industry Documents Library (IDL) collections.
Webinar: May 5: 8am PT / 11 am ET.
The UCSF-JHU Opioid Industry Documents Archive (OIDA) has added another 224,322 documents to the Insys Litigation Documents collection.
This batch includes a large portion of documents from 2014, the year Insys began direct-shipping their fentanyl-based Subsys to C&R Pharmacy in Florida, a pill-mill owned by Drs. Couch and Ruan, two of the highest prescribers of Subsys in the United States.
This release is the sixth batch of Insys documents to be added to OIDA; the Insys collection ultimately will contain several million documents that are currently being processed chronologically. Processed documents will be made public on a rolling basis with monthly releases expected in 2023–2024.
Over 7,300 new documents were posted to the USRTK collection today. This set of documents was donated by US Right to Know (USRTK) and acquired during their ongoing investigations into the influence of large food and beverage companies on academic partnerships and government regulatory processes around sugary beverages and obesity, among other topics.
Annual Tobacco and Industry Documents Workshop
Friday, May 12, 2023
9:30am - 2:15pm (PT)
This workshop will be a virtual event.
The UCSF Center for Tobacco Control Research and Education (CTCRE) will once again hold it's annual one-day workshop on using tobacco and other industry documents for advocacy. This popular workshop highlights how lessons learned from tobacco industry documents also apply to chemical, food, drug and opiate industries.
This year, the annual Tobacco and Industry Documents Workshop is collaborating with the University of California, San Francisco (UCSF) Philip R. Lee Institute for Health Policy Studies and Breast Cancer Action to uplift the use of the Industry Documents Library for breast cancer advocacy, campaign development, and research collaborations.
More information and upcoming registration linkThis event will feature a webinar where scholars will discuss how they successfully use OIDA and other Industry Documents Library (IDL) collections. We will conduct a hands-on virtual workshop that will help researchers navigate and explore OIDA’s rich collections.
Webinar: May 5: 8 am - 9:45 am (PT) Free and open to the public - Zoom Link
Opioid Industry Documents Archive
We added 127,511 documents to the UCSF-JHU Opioid Industry Documents Archive's Insys Litigation Documents collection. These documents, which arise from Insys’s early years bringing the fentanyl spray Subsys to market (2012–2013), shed new light on the genesis of the company’s speaker program and reimbursement center (See the Insys At a Glance page for more information), both of which have featured prominently in litigation against Insys.
This release is the fourth batch of Insys documents to be added to OIDA; the Insys collection ultimately will contain several million documents that are currently being processed chronologically. Processed documents will be made public on a rolling basis with monthly releases expected in 2023–2024. Information arising from a December 2022 release (UCSF News, Johns Hopkins University News) served as the basis for reporting from USA Today.
Opioid Industry Documents Archive National Advisory Committee Update
We are pleased to welcome four new members to our National Advisory Committee, a group that supports the Archive through expert recommendations on the project’s development and sustainability pertaining to use, transparency, accessibility, impact, and other measures: Sandy Alexander (former Massachusetts Assistant Attorney General), Michelle Muffett-Lipinski (recovery advocate and Founding Principal, Northshore Recovery High School), Melina Sherman (communications scholar, Knology), and Anthony Ryan Hatch (Professor of the Science in Society Program, Wesleyan University). Many thanks to our outgoing NAC member Beth Macy (author of Raising Lazarus and Dopesick) for her remarkable service.
3,600+ New USRTK Food Industry Documents Added
The 3,634 new documents posted today were donated by USRTK and acquired in their ongoing investigations into the influence of large food and beverage companies on academic partnerships and government regulatory processes around sugary beverages and obesity, among other topics.
Postdoctoral Fellowship in Opioid Industry Documents Research and Community Data Engagement -
The UCSF OIDA Postdoctoral Fellow will pursue original, publishable research using materials housed in OIDA and work closely with the archive research team to enhance the accessibility and usability of archival materials for a diverse array of communities, with a particular focus on racial and health equity. Fellows will work on a multidisciplinary team including faculty, other postdoctoral fellows and research assistants and will be mentored by and work closely with researchers and information specialists at UCSF. Fellows will be based at the UCSF Center for Tobacco Control Research and Education (https://tobacco.ucsf.edu/) and participate fully in the fellowship program. Fellows will also be affiliated with the Department of Humanities and Social Sciences at the UCSF School of Medicine (https://humsci.ucsf.edu/).
Postdoctoral Fellowship in Tobacco Control Research -
The CTCRE Postdoctoral Fellowship offers diverse educational and research opportunities, including a grant writing seminar, graduate research positions, advocacy training, and individualized documents training. Work spans policy and historical research, economics, and science. Fellows are recruited from a variety of fields including the basic sciences, social sciences, public health practitioners, clinical fields, political science, history, economics, law, and marketing.
Fellowship stipends range from $55,500 - $66,600, depending on years of postdoctoral experience.
More about the fellowships and application submission
The Digital Health Humanities Pilot (DHHP) will facilitate new insights into historical health data. Participants from all disciplines (including faculty, staff, and other learners) will learn how to evaluate and integrate digital methods and “archives as data” into their research through a range of offerings and trainings utilizing datasets from holdings within the UCSF Archives and Special Collections (including the AIDS History Project and Industry Documents Library, among others.)
Check out the workshops and sign up!
UC Love Data Week (February 13-17)
Want more information on working with data?
The UC-wide Love Data Week offers free sessions on topics such as data access, management, security, sharing, and preservation.
As 2022 comes to a close, we’d like to say a big THANK YOU to all of you for your continuing support and connection to the Industry Documents Library.
We’re grateful for your interest in industry documents and for your participation in the IDL community, whether that’s through documents research, workshops and trainings, project partnerships, or strategic planning and guidance.
This year we celebrated 20 years (!!!) of making industry documents available online and we appreciate all the ways you’ve worked with us to make the IDL stronger.
Here are some of the achievements you helped us reach in 2022:
17,508,831 documents now available through IDL!
We added 2.3 million new documents to the collections in 2022 -
If you’re able, please consider making a tax-deductible donation to the Industry Documents Library to help us preserve and provide access to the collections for years to come.
From all of us at the IDL, we wish you a safe and festive holiday season, and a healthy and hopeful New Year ahead.
Kate, Rachel, Rebecca, Sven, Melissa and Erik