Skip to main content

README

This document aims to present the dataset in a format that is easily understandable for you, helping you gain insight into the dataset's contents.

Citation metadata

DOI - https://doi.org/10.7910/DVN/SKP9IB

title

Development of an AI/ML-ready knee ultrasound dataset in a population-based cohort

grantNumber

grantNumberAgency: NIAMS

grantNumberValue: 3R01AR077060-03S1

dsDescription

dsDescriptionValue:

About this data

An ultrasound dataset to use in the discovery of ultrasound features associated with pain and radiographic change in KOA is highly innovative and will be a major step forward for the field. These ultrasound images originate from the diverse and inclusive population-based Johnston County Health Study (JoCoHS). This dataset is designed to adhere to FAIR principles and was funded in part by an Administrative Supplement to Improve the AI/ML-Readiness of NIH-Supported Data (3R01AR077060-03S1).

dsDescriptionDate: 2024-09-16

dsDescriptionValue:

To begin learning about this dataset, visit our User Guide for an all-in-one document containing statistics and other details to help you work with the data.

dsDescriptionDate: 2024-09-16

publication

publicationCitation: Yerich NV, Alvarez C, Schwartz TA, Savage-Guin S, Renner JB, Bakewell CJ, Kohler MJ, Lin J, Samuels J, Nelson AE. A Standardized, Pragmatic Approach to Knee Ultrasound for Clinical Research in Osteoarthritis: The Johnston County Osteoarthritis Project. ACR Open Rheumatol. 2020 Jul;2(7):438-448. doi: 10.1002/acr2.11159. PMID: 32597564; PMCID: PMC7368135.

publicationIDType: pmid

publicationIDNumber: 32597564

publicationURL: https://doi.org/10.1002/acr2.11159

author

authorName: Nelson, Amanda

authorAffiliation: University of North Carolina at Chapel Hill

authorIdentifierScheme: ORCID

authorIdentifier: 0000-0002-9344-7877

dateOfCollection

dateOfCollectionStart: 2019-03-14

dateOfCollectionEnd: 2024-06-01

subject

Medicine, Health and Life Sciences

Dataset files

24 files currently in this dataset

filenamedirectorycategoriesdescription
0README.notebook.public.ipynbcode['code', 'Jupyter', 'notebook']The purpose of this file is to provide documentation and preview of the data as a starting point for learning about the dataset. This notebook also provides instructions for running a virtual computing environment to further analyze the data and related metadata.
6_notebook.config.template.jsoncode['code', 'json']This file is used with the README.notebook.public.ipynb and contains the notebook variables.
3_notebook.instructions.mdcode['code', 'Jupyter', 'Markdown']This file is used with the README.notebook.public.ipynb and contains instructions on using the notebook.
22_notebook_installer.pycode['code', 'Jupyter', 'notebook', 'Python script']The purpose of this file is to install any Python modules used by the Jupyter notebook that are not included in the Jupyter environment by default. This script is only useful to those wanting to run the code within the Jupyter notebook.
21_notebook_worker.pycode['code', 'Jupyter', 'notebook', 'Python script']The purpose of this file is to provide the scripts to help describe the dataset within the Jupyter notebook. This script does most of the work behind the scenes for the Jupyter notebook. This code is kept separate from the notebook to prevent the notebook from looking too bloated with code that some users may not be interested in, since one of the primary purposes of the notebook is to describe the dataset and not describe the code that generates the descriptive information. This script is only useful to those wanting to run the code within the Jupyter notebook.
1example.us.image.pngdata/image/example['data', 'example']This is an example ultrasound file that represents one of the images from the image archives and should not be used for analysis purposes. Images in this dataset are saved in lossless .png format.
2imageArchive.11.zipdata/image/ultrasound['data', 'image']This file contains an archive of Right SUPRAPAT LONG, ultrasound images (file count: 867). There should only be one image per subject in this archive. Images are saved in lossless .png format.
7imageArchive.12.zipdata/image/ultrasound['data', 'image']This file contains an archive of Right SUPRAPAT LONG CPD, ultrasound images (file count: 867). There should only be one image per subject in this archive. Images are saved in lossless .png format.
8imageArchive.13.zipdata/image/ultrasound['data', 'image']This file contains an archive of Right SUPRAPAT TRANS 30, ultrasound images (file count: 866). There should only be one image per subject in this archive. Images are saved in lossless .png format.
9imageArchive.14.zipdata/image/ultrasound['data', 'image']This file contains an archive of Right MED LONG, ultrasound images (file count: 867). There should only be one image per subject in this archive. Images are saved in lossless .png format.
10imageArchive.15.zipdata/image/ultrasound['data', 'image']This file contains an archive of Right LAT LONG, ultrasound images (file count: 867). There should only be one image per subject in this archive. Images are saved in lossless .png format.
13imageArchive.16.zipdata/image/ultrasound['data', 'image']This file contains an archive of Right SUPRAPAT TRANS FLEX, ultrasound images (file count: 865). There should only be one image per subject in this archive. Images are saved in lossless .png format.
14imageArchive.17.zipdata/image/ultrasound['data', 'image']This file contains an archive of Right POST TRANS, ultrasound images (file count: 835). There should only be one image per subject in this archive. Images are saved in lossless .png format.
11imageArchive.21.zipdata/image/ultrasound['data', 'image']This file contains an archive of Left SUPRAPAT LONG, ultrasound images (file count: 866). There should only be one image per subject in this archive. Images are saved in lossless .png format.
16imageArchive.22.zipdata/image/ultrasound['data', 'image']This file contains an archive of Left SUPRAPAT LONG CPD, ultrasound images (file count: 866). There should only be one image per subject in this archive. Images are saved in lossless .png format.
12imageArchive.23.zipdata/image/ultrasound['data', 'image']This file contains an archive of Left SUPRAPAT TRANS 30, ultrasound images (file count: 866). There should only be one image per subject in this archive. Images are saved in lossless .png format.
17imageArchive.24.zipdata/image/ultrasound['data', 'image']This file contains an archive of Left MED LONG, ultrasound images (file count: 866). There should only be one image per subject in this archive. Images are saved in lossless .png format.
18imageArchive.25.zipdata/image/ultrasound['data', 'image']This file contains an archive of Left LAT LONG, ultrasound images (file count: 866). There should only be one image per subject in this archive. Images are saved in lossless .png format.
19imageArchive.26.zipdata/image/ultrasound['data', 'image']This file contains an archive of Left SUPRAPAT TRANS FLEX, ultrasound images (file count: 865). There should only be one image per subject in this archive. Images are saved in lossless .png format.
15imageArchive.27.zipdata/image/ultrasound['data', 'image']This file contains an archive of Left POST TRANS, ultrasound images (file count: 834). There should only be one image per subject in this archive. Images are saved in lossless .png format.
5dataTable.IMAGE_REF.tabdata/reference['data', 'reference']This file contains the ultrasound image metadata. Use this file to determine the number of ultrasound image types available, file sizes, and references to the files found in the ancillary image archives.
4dataTable.SUBJECT.tabdata/reference['data', 'reference']This is the first file generated for this dataset and contains the participant/subject reference IDs, basic subject demographics, PA knee KL grades and reported pain in knees.
20dvDatasetMetadata.jsondata/reference['data', 'reference']This file contains metadata created BEFORE data was uploaded to the repository dataset to help describe the data we EXPECT to have uploaded to the dataset. This is file was created since the repository does not contain built-in tools to describe data files and variables in depth, which is needed to both document the data and perform validation. This file is used to validate and describe categorical variables within the data files.
23curation_log.mddocumentation['curation']This document contains the dataset curation checklist and notes regarding the considerations and steps taken to curate the data.

24 files EXPECTED in this dataset

Datatable preview for dataTable.SUBJECT.csv

E03SUBJECTIDE03GENDERE03PASKRE03PASKLE03RADRPAKKLE03RADLPAKKLE03AGE
072bb0a51-f020-11ed-b527-0a580a5f736aFemaleModerateNoneModerate OAModerate OA50-54
172bb0a76-f020-11ed-b527-0a580a5f736aFemaleSevereNoneMild OAnan60-64
272bb0a8a-f020-11ed-b527-0a580a5f736aFemaleModerateModerateQuestionable OAMild OA65-70
372bb0a99-f020-11ed-b527-0a580a5f736aFemaleNoneMildNo OAQuestionable OA45-49
472bb0ab5-f020-11ed-b527-0a580a5f736aMaleNoneNoneQuestionable OAMild OA40-44

General statistics for file dataTable.SUBJECT.csv

E03GENDER

categorycounts
0Female587
1Male294

E03PASKR

categorycounts
0None399
1Mild196
2Moderate181
3Severe103
4No response2

E03PASKL

categorycounts
0None448
1Moderate168
2Mild168
3Severe95
4No response2

E03RADRPAKKL

categorycounts
0No OA303
1Questionable OA290
2Mild OA119
3Moderate OA89
4Severe OA63
5Total joint replacement2

E03RADLPAKKL

categorycounts
0No OA339
1Questionable OA262
2Mild OA120
3Moderate OA87
4Severe OA55
5Total joint replacement3

E03AGE

categorycounts
065-70171
160-64153
255-59149
350-54144
440-44104
545-49102
635-3958

Datatable preview for dataTable.IMAGE_REF.csv

E03SUBJECTIDE03USIMGTE03USIMGFE03USIMGZE03USIMGD
072bb0a51-f020-11ed-b527-0a580a5f736aLeft SUPRAPAT LONG72bb0a51-f020-11ed-b527-0a580a5f736a_21.png132460Left
172bb0a51-f020-11ed-b527-0a580a5f736aLeft SUPRAPAT LONG CPD72bb0a51-f020-11ed-b527-0a580a5f736a_22.png154179Left
272bb0a51-f020-11ed-b527-0a580a5f736aLeft SUPRAPAT TRANS 3072bb0a51-f020-11ed-b527-0a580a5f736a_23.png133222Left
372bb0a51-f020-11ed-b527-0a580a5f736aLeft MED LONG72bb0a51-f020-11ed-b527-0a580a5f736a_24.png121758Left
472bb0a51-f020-11ed-b527-0a580a5f736aLeft LAT LONG72bb0a51-f020-11ed-b527-0a580a5f736a_25.png108043Left

General statistics for file dataTable.IMAGE_REF.csv

E03USIMGT

categorycounts
0Right SUPRAPAT LONG867
1Right SUPRAPAT LONG CPD867
2Right LAT LONG867
3Right MED LONG867
4Left SUPRAPAT LONG866
5Left SUPRAPAT LONG CPD866
6Left SUPRAPAT TRANS 30866
7Left MED LONG866
8Left LAT LONG866
9Right SUPRAPAT TRANS 30866
10Left SUPRAPAT TRANS FLEX865
11Right SUPRAPAT TRANS FLEX865
12Right POST TRANS835
13Left POST TRANS834

E03USIMGD

categorycounts
0Right6034
1Left6029