------------------- GENERAL INFORMATION ------------------- 1. Title of Dataset: Data from "2024 State of Open at the University of Colorado Boulder" Report 2. Authors: Ryan Caillet, Melissa H. Cantrell, Andrew Johnson, Matthew Murray, Aditya Ranganath 3. Contact information: Andrew Johnson, andrew.m.johnson@colorado.edu 4. Date of data collection: 2023-2024 -------------------------- SHARING/ACCESS INFORMATION -------------------------- 1. Licenses/restrictions placed on the data: CC-BY 2. Links to publications that cite or use the data: https://doi.org/10.25810/zvst-2g91 3. Recommended citation for the data: Caillet, R., Cantrell, M., Johnson, A., Murray, M., & Ranganath, A. (2024). Data from "2024 State of Open at the University of Colorado Boulder" Report [Data set]. University of Colorado Boulder. https://doi.org/10.25810/wzkx-pn69 --------------------- DATA & FILE OVERVIEW --------------------- 1. File List: A. Filename: CUBoulderOAFund2013_2023.csv B. Filename: CUBoulderOpenAlexAPCs_2023.csv C. Filename: CUBoulderPublishedData2014_2023.csv D. Filename: CUBoulderTotalOAPublishing2014_2023.csv E. Filename: CUScholarContent20231231.csv 2. Relationship between files: Each file was used to produce a section of the "2024 State of Open at the University of Colorado Boulder" report (https://doi.org/10.25810/zvst-2g91). -------------------------- METHODOLOGICAL INFORMATION -------------------------- This data set contains five data files that were used to produce the "2024 State of Open at the University of Colorado Boulder" report: 1. CUBoulderOAFund2013_2023.csv contains data from articles funded by the CU Boulder Libraries Open Access Fund from 2013 to 2023. This data was collected by CU Boulder Libraries personnel from successful applications to the Open Access Fund. 2. CUBoulderOpenAlexAPCs_2023.csv contains data on APCs paid for articles with CU Boulder authors published in 2023 from OpenAlex (https://openalex.org/). 3. CUBoulderPublishedData2014_2023.csv contains data from the CU Boulder Faculty Reports of Professional Activities from 2014 to 2023. CU Boulder Libraries personnel coded this data for the variables provided. 4. CUBoulderTotalOAPublishing2014_2023.csv contains data on type of open access article from Unpaywall (https://unpaywall.org/) matched against data on articles authored by CU Boulder faculty from CU Boulder Elements (https://www.colorado.edu/fis/CUBE). CU Boulder Libraries personnel exported the data provided on August 26, 2024. 5. CUScholarContent20231231.csv contains data on all of the items in the CU Scholar institutional repository as of December 31, 2023. This data was exported by CU Boulder Libraries personnel from the CU Scholar (Samvera) software on December 31, 2023. ----------------------------------------- DATA-SPECIFIC INFORMATION FOR: CUBoulderOAFund2013_2023.csv ----------------------------------------- 1. Number of variables: 6 2. Number of cases/rows: 411 3. Variable List: A. Name: Date Description: Date (MM/DD/YY) application to CU Boulder Libraries Open Access Fund was received. B. Name: Status Description: CU Boulder status (faculty, PhD student, staff, etc.) of applicant to CU Boulder Libraries Open Access Fund. C. Name: Department Description: Name of CU Boulder primary department or other affiliation of applicant to CU Boulder Libraries Open Access Fund. D. Name: Journal title Description: Name of journal in which applicant to CU Boulder Libraries Open Access Fund is publishing. E. Name: Publisher Description: Name of publisher of journal in which applicant to CU Boulder Libraries Open Access Fund is publishing. F. Name: Paid Amount Description: Amount CU Boulder Libraries Open Access Fund paid to cover the article processing charge (APC) for the applicant. 4. Data codes (e.g., N/A = Not applicable): CU Boulder department/unit acronyms can be found here: https://www.colorado.edu/bfa/resources/acronyms ----------------------------------------- DATA-SPECIFIC INFORMATION FOR: CUBoulderOpenAlexAPCs_2023.csv ----------------------------------------- 1. Number of variables: 4 2. Number of cases/rows: 2080 3. Variable List: A. Name: doi Description: Digital Object Identifier (DOI) for each article with a CU Boulder author in OpenAlex. B. Name: publisher Description: Publisher for each article with a CU Boulder author in OpenAlex. C. Name: open_access_status Description: Open access status (e.g., gold) for each article with a CU Boulder author in OpenAlex. D. Name: apc_paid Description: Estimated amount of APC paid for each article with a CU Boulder author in OpenAlex. ----------------------------------------- DATA-SPECIFIC INFORMATION FOR: CUBoulderPublishedData2014_2023 ----------------------------------------- 1. Number of variables: 6 2. Number of cases/rows: 503 3. Variable List: A. Name: Year Description: Year in which a published data set was reported on the Faculty Report of Professional Activities at CU Boulder. B. Name: Dept Description: Department/unit of individual who reported a published data set on the Faculty Report of Professional Activities at CU Boulder. C. Name: DOI - yes or no Description: Whether or not a DOI was included in the citation for a published data set on the Faculty Report of Professional Activities at CU Boulder. D. Name: URL - yes or no Description: Whether or not a URL (and not a DOI) was included in the citation for a published data set on the Faculty Report of Professional Activities at CU Boulder. E. Name: Repository Name Description: The name of a repository included in the citation for a published data set on the Faculty Report of Professional Activities at CU Boulder (if applicable). F. Name: Repository Type Description: Type of repository included in the citation for a published data set on the Faculty Report of Professional Activities at CU Boulder as categorized by CU Boulder Libraries personnel. Possible values are Domain, External General, or Institutional. 4. Data codes (e.g., N/A = Not applicable): N/A = Not applicable Domain = A repository that serves a community defined by domain or data type External General = A repository that serves all domains and is open to deposit by anyone Institutional = A repository that serves a single institution (e.g., a university) ----------------------------------------- DATA-SPECIFIC INFORMATION FOR: CUBoulderTotalOAPublishing2014_2023.csv ----------------------------------------- 1. Number of variables: 14 2. Number of cases/rows: 10 3. Variable List: A. Name: Year Description: Year in which article by CU Boulder faculty member was published. B. Name: Closed (n) Description: Number of closed access articles published by CU Boulder faculty in a given year. C. Name: Gold (n) Description: Number of Gold open access articles published by CU Boulder faculty in a given year. D. Name: Green (n) Description: Number of Green open access articles published by CU Boulder faculty in a given year. E. Name: Hybrid (n) Description: Number of Hybrid open access articles published by CU Boulder faculty in a given year. F. Name: Bronze (n) Description: Number of Bronze open access articles published by CU Boulder faculty in a given year. G. Name: Total OA (n) Description: Total number of all types of open access articles published by CU Boulder faculty in a given year. H. Name: Total (n) Description: Total number of all articles published by CU Boulder faculty in a given year. I. Name: Total OA (%) Description: Percentage of all types of open access articles published by CU Boulder faculty in a given year. J. Name: Gold (%) Description: Percentage of Gold open access articles published by CU Boulder faculty in a given year. K. Name: Green (%) Description: Percentage of Green open access articles published by CU Boulder faculty in a given year. L. Name: Hybrid (%) Description: Percentage of Hybrid open access articles published by CU Boulder faculty in a given year. M. Name: Bronze (%) Description: Percentage of Bronze open access articles published by CU Boulder faculty in a given year. N. Name: Closed (%) Description: Percentage of closed access articles published by CU Boulder faculty in a given year. 4. Data codes (e.g., N/A = Not applicable): Definitions of open access types (as defined in Piwowar H, Priem J, Larivière V, Alperin JP, Matthias L, Norlander B, Farley A, West J, Haustein S. 2018. The state of OA: a large-scale analysis of the prevalence and impact of Open Access articles. PeerJ 6:e4375 https://doi.org/10.7717/peerj.4375): "Gold: Published in an open-access journal that is indexed by the DOAJ. Green: Toll-access on the publisher page, but there is a free copy in an OA repository. Hybrid: Free under an open license in a toll-access journal. Bronze: Free to read on the publisher page, but without an clearly identifiable license. Closed: All other articles, including those shared only on an ASN or in Sci-Hub." ----------------------------------------- DATA-SPECIFIC INFORMATION FOR: CUScholarContent20231231.csv ----------------------------------------- 1. Number of variables: 4 2. Number of cases/rows: 17448 3. Variable List: A. Name: Title Description: Title of each item in the CU Scholar institutional repository. B. Name: Academic Affiliation Description: CU Boulder department(s) or unit(s) affiliated with each item in the CU Scholar institutional repository. C. Name: Resource Type Description: Type of content for each item in the CU Scholar institutional repository. D. Name: URL Description: URL for each item in the CU Scholar institutional repository.