Metadata and Images from the OIDA Image Collection

Introduction

The OIDA Image Collection highlights images extracted from documents created by the opioid industry. Many of these documents were designed for internal company audiences and board members, while others were targeted to prescribers and consumers. The images provide insight into corporate practices that shaped the opioid crisis.

Images from OIDA provide unique entry points to understand a visual narrative of the opioid industry and gain insight into harmful corporate and marketing practices that contributed to the opioid crisis. Metadata are also included. OIDA staff at UCSF and Johns Hopkins generated some of the image descriptions, types, and other metadata using artificial intelligence (AI) models, with human experts revising some of these values. Some metadata, such as image categories (e.g., Addiction, Public Relations, Regulatory) and titles were created by human experts. Other manually created metadata about images may be added in the future.

Download the data files

You can access the full set of images featured in the OIDA Image Collection, plus descriptive metadata (described below) that includes references to source documents, in AWS:

Download full set of images, ZIP file (1.4 GB): image_collection_version_1.zip

Download metadata, GZipped CSV (1.7 MB): oida-image-collection-metadata-version-1.csv.gz

Block Featured Image

How to cite this data product

UCSF-JHU Opioid Industry Documents Archive (2024). Metadata and Images from the OIDA Image Collection. Available at https://doi.org/10.26144/ps14-tv04.

Dataset field descriptions

Data Element Sample Data Description
filename 02f56dc0-dfa3-4967-a4e5-10bd7cda4dfb.png The image filename: the image_id plus file extension
image_id 02f56dc0-dfa3-4967-a4e5-10bd7cda4dfb A Universally Unique Identifier (UUID)
title Mallinckrodt launches generic version of Concerta in U.S. (press release, dated December 31, 2012) The title of the image, if one is provided. Most often, this is a cataloger-supplied title.
description_caption The image is a photograph of a newspaper article titled "Mallinckrodt Launches Generic Version of CONCERT® in U.S. Company to hold six-month exclusivity on 27, 36 and 54 milligram dosage strengths of ADHD treatment. St. Louis, December 31, 2012. The article is dated December 31st, 2012 and is published by the Pharmaceuticals Business of Covidien. The article mentions that the company has received approval from the U. S. Food and Drug Administration (FDA) to manufacture and market a generic version of CONCERTRA® (Methylphenidate HCI) Extended-Release (ER) Tablets. The company will launch Methylphenidine HCI tablets in the 27 mg dosage strength immediately. A brief description of the image. See the associated field captioned_by_ai for a flag on whether this value was generated by AI or not.
captioned_by_ai 1 A flag indicating whether the value in the description_caption field is generated by an AI model (1) or not (0).
ocr_text_extracted Mallinckrodt Launches Generic Version Of CONCERTAS in US: Company to hold six-month exclusivity on 27, 36 and 54 milligram dosage strengths of ADHD treatment. ST. LOUIS December 31, 2012 Mallinckrodt; the Pharmaceuticals business of Covidien (NYSE: COV); today announced that it has received approval from the U.S. Food and Drug Administration (FDA) to manufacture and market a generic version of CONCERTA@ (methylphenidate HCI) Extended-Release (ER) Tablets USP (CII) in 27 , 36 and 54 mg dosage strengths: The company will launch Methylphenidate HCI ER Tablets in the 27 mg dosage strength immediately. The text in the image as extracted by an OCR (Optical Character Recognition) model.
image_type Advertisement Controlled vocabulary field. One of the following values: Graph or Chart, Table, Map, Photograph, Advertisement, or Illustration
type_classified_by_ai 1 A flag indicating whether the value in the image_type field is generated by an AI model (1) or not (0).
category Public Relations A category for the content of the image. Most often assigned by catalogers. One of the following values: Addiction, Business Strategy, Cartoons, Historical Images, Memes and Pop Culture, Outreach and Education, Packaging and Labeling, Pain Management, Pharma Employees, Public Relations, Regulatory, or Sales and Marketing
category_classified_by_ai 0 A flag indicating whether the value in the category field is generated by an AI model (1) or not (0).
topics   One or more Library of Congress Subject Headings; Multiple values delimited by a pipe (|).
drug   If the image relates to one or more particular drugs they are noted here. Brand names and generic names may appear. Generic name(s) are noted in our controlled vocabulary (XLSX). Multiple values delimited by a pipe (|).
keywords   One or more labels related to the image; pipe-delimited if there are multiple values.
creation_date 4/4/24 2:59 Timestamp for the creation of this record.
date_modified 4/4/24 2:59 Timestamp marking the last update to this record.
urls https://www.industrydocuments.ucsf.edu/opioids/docs/#id=pndp0242|https://www.industrydocuments.ucsf.edu/opioids/docs/#id=zrdp0242 Hyperlink of the particular OIDA parent document in the Industry Documents Library; pipe-delimited when the image appears in more than one OIDA document.
collections Mallinckrodt Litigation Documents|Mallinckrodt Litigation Documents The OIDA collection name for this particular parent document; pipe-delimited when the image appears in more than one OIDA document.
years 2013|2013 The year the particular OIDA parent document was dated; pipe-delimited when the image appears in more than one OIDA document.