Predicting COVID-19 In Chest X-Ray Images - Stanford University

1y ago

31 Views

2 Downloads

8.62 MB

10 Pages

Last View : Today

Last Download : 3m ago

Upload by : Sasha Niles

Report this link

Download PDF

Transcription

Predicting COVID-19 in Chest X-Ray ImagesHealthcare/Computer VisionYash ManiyarComputer ScienceStanford Universityymaniyar@stanford.eduMadhu KarraComputer ScienceStanford Universitymkarra@stanford.eduArvind SubramanianComputer ScienceStanford Universityarvindvs@stanford.edu1Problem DescriptionMost existing COVID-19 tests use nasal swabs and a polymerase chain reaction to detect the virus in a sample. We aim todevelop an alternative, computer vision based method of identifying whether or not a patient is infected with COVID-19, viralpneumonia, or neither based on an X-ray image of their chest. We hope that such a model will expand access to quick, accuratediagnoses of COVID-19, and that the architecture we produce may be able to be re-purposed to detect other lung conditions.22.1Dataset and PreprocessingDatasetThe dataset used in this project is composed of images from two separate chest x-ray datasets. Together, these images constitute16 classes: 15 disease classes and 1 normal class.The COVID-19 Radiography Database [2] is a collection of about 4000 chest X-ray images, each labeled as one of three classes:COVID-19, viral pneumonia, or “normal" (neither COVID-19 nor viral pneumonia). We used this dataset for the first part of ourproject.(a) Normal(b) COVID-19(c) Viral PneumoniaFigure 1: COVID Chest X-ray dataset split into 3 classesCS230: Deep Learning, Winter 2021, Stanford University, CA. (LateX template borrowed from NIPS 2017.)

The NIH Chest X-Ray Dataset [6,7] is a collection of approximately 122,000 chest X-ray images, each labeled as one of 15classes. We used this dataset in the second part of our project.(a) Normal(b) Pneumo- (c) Atelectasisnia(i) Fibrosis(j) Effusion(d) Consolida- (e) y(f) Pneumothorax(m) Nodule(g) Edema(n) Mass(h) Emphysema(o) HerniaFigure 2: NIH Chest X-ray dataset split into 15 classesWe create a novel dataset for our project by sampling approximately 1000 images from each dataset. The total number of imagesfrom each class are shown in Table 1.Table 1: Dataset classes EffusionPleural 456789101112131415# of 0100010001000110PreprocessingSince the X-ray images are square and of varying resolution, we standardized the resolution to 256 256 pixels. Afterwards,each pixel value was normalized to be between 0 and 1. Additionally, we convert each image to grayscale. Finally, we subtractthe mean value of each image’s pixels then divide by the standard deviation.We split the dataset into training, validation, and test sets. The training set used 98% of the data, the validation set used 1% of thedata, and the test set used 1% of the data.We created a dataloader to efficiently load the training and validation datasets, randomly shuffle the data, and batch the data.The dataloader performs the requisite transforms (resize to 256 256, scale pixel values between 0 and 1) when loading theparticular batch to be used. We use a batch size of 32 for most of our models.2

3Experiments3.1Baseline: 4-Layer Conv NetworkAs a starting point, we wrote a 4 layer convolutional neural network with Dropout. Full details of this network are given in Table2 in the Appendix. We trained this model separately on the COVID Radiography dataset and the merged dataset.3.2ResNet-Inspired NetworksIn an attempt to capture the complexity of the merged, 16-class dataset, we drew inspiration from ResNet [8], a popular modelused for multi-class image classification. We wrote two versions of a “Conv-Skip Block" (A and B), whose exact structureis defined in Tables 3 and 4 in the Appendix. The basic different between these blocks is that Conv-Skip Block A has threeconvolutional layers, and adds the identity of the input to the later activation, while Conv-Skip Block B has only two convolutionallayers: one has a user-defined number (o) of 3x3 filters, while the other is a simple 1x1 convolution done on the identity of theinput, to be added to the later activation. Both Blocks use Dropout and Batch Normalization.3.2.1Model AOur first network (defined in Table 5 in the Appendix) stacks three Conv-Skip Block As, followed by fully connected layers.This model used no Dropout (p 0) and was trained on the merged, 16-class dataset.3.2.2Model BIn an attempt to counter over-fitting, Model B (Table 6 in the Appendix) has a very similar overall architecture as Model A (withone extra convolutional layer at the beginning), but with Dropout. This model was also trained on the merged, 16-class dataset.3.2.3Model CTo further reduce over-fitting, Model C (Table 7 in the Appendix) removed uses Conv-Skip Block B instead of A (making for aslightly smaller model). Dropout was also made more aggressive in this model; we increased drop probability. Model C wastrained separately on both the merged 16-class dataset, and the smaller 3-class COVID Radiography dataset.4Results and Analysis4.1Trained on COVID-19 Radiography DatasetOur first task was to train our models on the full set of images provided in the radiography 3-class dataset. The task was toclassify X-ray images as either normal, pneumoniac or COVID lungs. For this dataset, we used a suite of CNNs with differentvariations and techniques, as described in the following sections. We split our data into train, test, and validation splits, each witha roughly even distribution of labels (we chose a 98-1-1 split).4.1.1Baseline: 4-Layer CNNOur 4-layer baseline network converged quickly (Figure 6, Appendix); after training for 10 epochs, we observed a test-setaccuracy of 90% and a validation-set accuracy of 92% Given that this relatively simple model performed so well, we tryModel C, a more complex model, on the same dataset to see if we can boost performance.4.1.2Model CAfter running Model C for 15 epochs, we observe a test-set accuracy of 92% and a validation-set accuracy of 91%, slightlyoutperforming our 4-layer conv network. We further experimented with learning rate and hidden sizes, although none of theseexperiments yielded significant improvements in performance.4.2Trained on Novel Expanded Dataset: NIH Chest X-RayAfter exhausting our models’ capabilities on the 3-class radiography dataset, we increase the complexity of our task by including16 classes (see Dataset section for a description).3

4.2.1Model AWe first train model A, heavily inspired by the ResNet architecture, on the 16-class dataset. We observe dramatic over-fitting,with our train accuracy reaching a peak of 92% after 20 epochs, but a validation accuracy that never exceeded 28%. Figure 3shows the learning curves; we see a steadily decreasing training loss alongside a monotonically increasing validation loss. Forour next set of experiments, we decide to reduce the complexity of the model and introduce various regularization techniques inan attempt to combat the observed over-fitting problem.(a) Train loss(b) Validation lossFigure 3: Learning Curves from Model A on 16-class dataset4.2.2Model BWe train Model B, hoping it might reduce over-fitting by adding a penalty to the loss function through dropout regularization.After 15 epochs, we observed a training-set accuracy of 73% with a validation accuracy of 19%. Figure 7 in the Appendixshows the learning curves; we see over-fitting once again, with worsened performance on both data splits, when compared toModel A.4.2.3Model CWe test out a much smaller and less-complicated model, Model C, on the same dataset and observe results. After 8 epochs,we observed a train accuracy of 65% and a test-set accuracy of 30%. Although the discrepancy between train and validationaccuracies was smaller than those observed with Model A and Model B, it is clear that Model C has a harder time picking upfeatures of the dataset, likely due to the smaller number of learnable parameters. See Figure 4 for learning curves from Model C.We treat Model C as our best model for this task, since it achieved the highest accuracy on unseen data.(a) Train loss(b) Validation lossFigure 4: Learning Curves from Model C on 16-class dataset4

4.3AnalysisOur final metrics on the 16-class extended task were much worse than our metrics for the 3-class dataset, showing us that theextended dataset presents a much more challenging classification task. We saw dramatic over-fitting with more complex models,and a lack of generalizability and sometimes under-fitting with the less-complicated models that we tried. These results suggestthat a model, given more and more learnable parameters, tends to memorize this data rather than pick up on generalizablepatterns. Since the 3-class version of this task seemed to be have a much more learnable objective, we hypothesize that some ofthe classes introduced in the NIH dataset are harder to differentiate from one another than the classes present in the Covid-19Chest X-ray dataset (pneumonia, covid, healthy). To test this hypothesis, we generate a confusion matrix for Model C on the testset (see Figure 5 below). We see that our model was did a good job of detecting COVID lungs, healthy lungs, and pneumonialungs, which were the three classes present in the Covid-19 Chest X-ray dataset. The model had a much more difficult timediscriminating between the other lung diseases introduced in the NIH dataset. As a reference, we ran a pre-trained ResNet-18model, and we observed the same trends (over-fitting to training dataset with poor performance on unseen data).To further understand the errors our best model made on the 16-class data, we calculated the percentage of test examples whosecorrect class was within the top 3 from our predictions. It was good to see that although Model C achieved a test accuracy of30%, around 53% of test examples had the correct label in the top 3 classes in our output.Upon looking through the NIH dataset, it appears that certain diseases have a lot of variation in the images corresponding tothat label. It’s possible that some of these diseases manifest differently in different patients, leading to higher variation in chestX-ray features (see Figure 8 for example images for cardiomegaly). Furthermore, the labels for this dataset were created throughan NLP bootstrapping algorithm; although the expected accuracy is said to be higher than 90%, perhaps noise introduced bymislabeled examples makes it harder for models to learn an objective function.Figure 5: Confusion Matrix (y true label, x prediction)5

References[1] Pasa, F., et al. "Efficient deep network architectures for fast chest X-ray tuberculosis screening and visualization." Scientific reports 9.1(2019): 1-9.[2] Chowdhury, Muhammad EH, et al. "Can AI help in screening viral and COVID-19 pneumonia?." IEEE Access 8 (2020): 132665-132676.[3] Hemdan, Ezz El-Din, Marwa A. Shouman, and Mohamed Esmail Karar. "Covidx-net: A framework of deep learning classifiers to diagnosecovid-19 in x-ray images." arXiv preprint arXiv:2003.11055 (2020).[4] Zhang, J., et al. "Viral pneumonia screening on chest x-ray images using confidence-aware anomaly detection (2020)." arXiv preprintarXiv:2003.12338.[5] Abbas, Asmaa, Mohammed M. Abdelsamea, and Mohamed Medhat Gaber. "Classification of COVID-19 in chest X-ray images usingDeTraC deep convolutional neural network." Applied Intelligence (2020): 1-11.[6] Wang, X., et al. "Hospital-scale chest x-ray database and benchmarks on weakly-supervised classification and localization of commonthorax diseases." IEEE CVPR. 2017.[7] lable-chest-x-ray-datasets-scientificcommunity[8] K. He, X. Zhang, S. Ren and J. Sun, "Deep Residual Learning for Image Recognition," 2016 IEEE Conference on Computer Vision andPattern Recognition (CVPR), Las Vegas, NV, USA, 2016, pp. 770-778, doi: 10.1109/CVPR.2016.90.6

5AppendixGithub repo: e 2: 4-layer CNN ArchitectureLayernum. output featuressize of outputkernel sizeInputConv2D C1MaxPool2DReLUDropout(p 0.1)Conv2D C2MaxPool2DReLUDropout(p 0.2)Conv2D C3MaxPool2DReLUDropout(p 0.5)Conv2D C4MaxPool2DReLUFlattenLinear FC1ReLULinear FC2ReLULinear 1256 x 256256 x 256128 x 128128 x 128128 x 128128 x 12864 x 6464 x 6464 x 6464 x 6432 x 3232 x 3232 x 3232 x 3216 x 1616 x –3x32x2––3 x 532x2––––––––Table 3: Conv-Skip Block A (i, h, o, p)Layernum. output featuressize of outputkernel sizeInputConv2D C1BatchnormReLUConv2D C2Add InputReLUConv2D xNNxNNxNNxNNxNN/2 x N/2–3x3––3x3––3x3–––7

Table 4: Conv-Skip Block B (i, o, p)Layernum. output featuressize of outputkernel sizeInputConv2D C1BatchnormReLUConv2D C2(Input)Add NNxNNxNNxNNxNN/2 x N/2–3x3––1x1––––Table 5: Model ALayernum. output featuressize of outputkernel sizeInputConv-SkipA(h 16, p 0.0) SB1Conv-SkipA(h 16, p 0.0) SB2Conv-SkipA(h 16, p 0.0) SB3FlattenLinear FC1ReLULinear FC2Softmax132643211111256 x 256128 x 12864 x 6432 x um. output featuressize of outputkernel sizeInputConv2D C1ReLUConv-SkipA(h 16, p 0.1) SB1Conv-SkipA(h 32, p 0.2) SB2Conv-SkipA(h 64, p 0.3) SB3FlattenLinear FC1ReLULinear FC2Softmax116166412812811111256 x 256256 x 256256 x 256128 x 12864 x 6432 x Table 6: Model BTable 7: Model CLayernum. output featuressize of outputkernel sizeInputConv2D C1MaxPool2DReLUConv-SkipB(p 0.1) SB1Conv-SkipB(p 0.3) SB2Conv-SkipB(p 0.6) SB3FlattenLinear FC1ReLULinear FC2Softmax1161616326412811111256 x 256256 x 256128 x 128128 x 12864 x 6432 x 3216 x –8

(a) Train loss(b) Validation lossFigure 6: Learning Curves on 4-layer Conv Network on 3-class dataset(a) Train loss(b) Validation lossFigure 7: Learning Curves from Model B on 16-class dataset9

(a) Cardiomegaly 1(b) Cardiomegaly 2Figure 8: COVID Chest X-ray dataset split into 3 classes10(c) Cardiomegaly 3

Computer Science Stanford University ymaniyar@stanford.edu Madhu Karra Computer Science Stanford University mkarra@stanford.edu Arvind Subramanian Computer Science Stanford University arvindvs@stanford.edu 1 Problem Description Most existing COVID-19 tests use nasal swabs and a polymerase chain reaction to detect the virus in a sample. We aim to

Related Documents:

CT Protocols

Chest o Chest/Abdomen/Pelvis (Routine) 43 o Chest with or without contrast (Routine) 44 o High Resolution Chest 45 o Chest Angio Protocol (PE) 46 o Coronary Artery Calcium Score 47 o Cardiac (Heart) Score (coronary artery or pulmonary vein) 48 Overread (Addendum) o Aortic Dissection – Chest w/o & CTA

24 Views

2y ago

University Radiology To Order a CT or CTA study call 305 ...

CT Chest Asbestosis (High Resolution) Fibrosis CT Chest Without CT Chest Without Contrast --- HIGH RESOLUTION 71250 Contrast Lung Cancer Screening (Low Dose Protocol) CT Chest Without Contrast 71250 CTA Chest (PE Study) Chest Pain/Dyspnea DVT Tachypnea Hemoptysis Shortness of Bre

25 Views

2y ago

Chest Radiography for Radiologic Technologists - Idaho State University

Evaluate a chest radiograph for various devices such as endotracheal tubes, chest tubes and central venous catheters. Describe several pathologies of the chest. The chest exam is performed more frequently than any other exam in the imaging department. It is important for radiographers to under-stand the standards for imaging the chest because

16 Views

1y ago

CPG Chest Pain EN PDF

Chest Pain 4. Introduction 4.1. ESSENTIALS Chest pain complaints are of common occurrence in medical practice. Chest pain frightens the patient and puts the physician on the alert, as it is often a symptom of a serious disease. From the diagnostic standpoint, chest pain may present a real challenge to the physician.

29 Views

3y ago

Chest Pain [Read-Only] - University of Washington

Chest Pain Objectives Discuss a ggpp peneral approach to chest pain Review differential diagnosis Develop an understanding of the diagnosis and management of common and serious causes ofmanagement of common and serious causes of chest pain Background Chest

33 Views

2y ago

Using the Atrium Oasis Closed Chest Drainage System

Using the Atrium Oasis Closed Chest Drain Receiving the Patient with a Chest Drain Secure the unit below the level of chest tube/ chest drain insertion. Keep the unit in an upright position. Two hanging hooks will fold out of the carry handle of t

23 Views

2y ago

Chest Mobilization Techniques for Improving …

Chest mo bilization is one of many techniques and very important in conventional chest physical therapy for increasing chest wall mobility and improving ventilation (Jennifer & Prasad, 2008). Ei ther passive or active chest mobil

12 Views

2y ago

COVID-19 OBSSR Research Tools

COVID-19 Mental health impact COVID-19 Impact on Sleep COVID-19 Positive Impacts University of California, San Dr. Ariel J. Lang ajlang@health.ucsd.edu ID: 21877 COVID-19 Household Environment Scale (CHES) - English COVID-19 Household Environment Scale (CHES) - Spanish COVID-19 Social Distancing and Symptoms COVID-19 on Family .

104 Views

3y ago

Recent Views

Chapter 15 Rooming Houses - MassLegalHelp

Individual renters usually have their own separate room and their own agreement with the landlord. For example, you may stay for just a few days, but another renter may stay for 3 months. Rooming houses with 4 or more renters at the same time must be licensed. Some cities and towns have local protections for renters in rooming houses. Rooming House

2y ago

356 Views

Americans rent, buy, sell and think about home.

median rent among Generation X is 1,062 per month. The youngest renters, Generation Z, are typically paying the least at 882 per month.9 This echoes the notion that Generation Z renters are opting to rent the smallest apartments or homes, which translates to lower monthly rental payments. Approximately half of renters (47 percent) are paying for

1y ago

174 Views

Disaster assistance process overview

A guide through the post-disaster recovery process. KEY ASSISTANCE SOURCES TIPS HOMEOWNERS/RENTERS INSURANCE If you have homeowners or renters insurance, this provides you funds to repair or replace property damaged as a result of covered perils during a disaster. Additional types of insurance, such as auto or other peril-specific

1y ago

109 Views

Personal insurance - Car & Business insurance King Price Insurance

The king's insurance options 5 Things you need to know 7 The stuff you need to do 14 How to claim 16 Our commitment to you 20 Car insurance 22 Car warranty 37 Shortfall cover 45 Scratch and dent 46 Tyre and rim 48 Motorbike insurance 53 Trailer and caravan insurance 64 Watercraft insurance 68 Home contents insurance 77 Buildings insurance 89

1y ago

673 Views

Texas Demographic Trends and Projections and the 2020 Census

Income disparities place African Americans and Latinos at greater risk during times of income loss. Renters, renters w/low incomes, Blacks, and households w/children face greater risk of eviction. Persistently low health insurance coverage in the state increases vulnerability of Texans with employer based insurance.

1y ago

137 Views

Gold Tier - MAPFRE Insurance

Foy Insurance of MA, LLC 198 Frank Consolati Insurance Agency, Inc. 198 County Insurance Agency, Inc. 198 Woodrow W Cross Agency 214 Woodland Insurance Agency, Inc. 214 Tegeler Insurance Services of CT, Inc. 214 Pantano/VonKahle Insurance Agency, Inc. 214 . Hanson Insurance Agency, Inc. 287 J.H. Slattery Insurance Agency, Inc. 287

1y ago

565 Views

Texas - milestonepnc

State Auto - Homeowners TEXAS 05/2017 State Auto Insurance Company UG-1.0 I - UNDERWRITING GUIDELINES A. Entire State Eligibility Guidelines Premier Protection Plus Standard Available Forms HO0004 - Renters HO0005 - Homeowner Expanded HO0006 - Condominium HO0003 - Homeowner HO0004 - Renters HO0005 - Homeowner Expanded

1y ago

112 Views

Consumer Guide to Auto Insurance - csimt.gov

consumer guide to auto insurance contents introduction to auto insurance 1 understanding your auto insurance policy 2 required auto insurance 3 optional types of auto insurance 4-5 getting the right coverage 6 accidents and violations 7 how to shop for auto insurance 8 shopping tips 9 frequently asked questions 10-11 insurance complaints/when you have a problem 12

2y ago

805 Views

Industry Observations Insurance Industry

Jun 30, 2019 · 6/17/2019 Commercial Insurance Branch of Extraco Banks, N.A. Higginbotham Insurance Group, Inc. Insurance Brokers NA 6/13/2019 Links Insurance Services, LLC World Insurance Associates LLC Property and Casualty Insurance NA 6/13/2019 Abram Interstate Insurance Services, Inc. Risk Placement Services,

2y ago

619 Views

Life Insurance Buyer's Guide Life Insurance - National Association of .

Life Insurance uers uide Naional ssociaion of Insurance Commissioners Compare the Different Types of Insurance Policies There are many types of life insurance pol-icies. You should choose a policy with fea-tures that fit your individual needs. Some things to consider are: Term Insurance vs. Cash Value In-surance. Term insurance is intended to

1y ago

520 Views

your guide to understanding auto ins in nh - New Hampshire

Hampshire Insurance Department does not mandate or set Auto Insurance Rates. Auto Insurance Rates will vary by insurance company. This guide is intended to give New Hampshire consumers basic information on auto insurance. It suggests ways to: Lower the cost of your auto insurance, shop for Auto insurance and, file an auto insurance claim.

1y ago

449 Views

18.01.41 - REPLACEMENT OF LIFE INSURANCE AND ANNUITIES - Idaho

Department of Insurance Replacement of Life Insurance and Annuities. Page 3. 04. Existing Life Insurance or Annuity. "Existing Life Insurance or Annuity" means any life insurance or annuity in force, including life insurance under a binding or conditional receipt or a lif e insurance policy or annuity that is within an unconditional refund period.

1y ago

407 Views

EXAMINATION REPORT OF THE ADMIRAL INSURANCE COMPANY AS OF . - Delaware

Berkley Regional Specialty Insurance Comp 31295 DE Carolina Casualty Insurance Company 10510 IA Clermont Insurance Company 33480 IA Continental Western Insurance Company 10804 IA Firemen's Insurance Com pany of Wash, D.C. 21784 DE Gemini Insurance Company 10833 DE Great Divide Insurance Company 25224 ND

1y ago

258 Views

American International Group, Inc. - Federal Reserve

American General Life Insurance Company AGL U.S. Life Insurance Company AGC Life Insurance Company AGC Life U.S. Life Insurance Company The United States Life Insurance Company in the City of New York U.S. Life U.S. Life Insurance Company The Variable Annuity Life Insurance Company VALIC U.S. Life Insurance Company

1y ago

269 Views

Japan's Insurance Market - Toa Re

with 61.6% of net premiums written, of which automobile insurance totaled 48.8% and compulsory automobile liability insurance totaled 12.8%. Fire insurance accounted for 13.7%, miscellaneous casualty insurance including liability insurance accounted for 11.6%, accident insurance accounted for 9.8%, and marine insurance accounted for 3.2%.

1y ago

179 Views

Predicting COVID-19 In Chest X-Ray Images - Stanford University

It looks like you're using an ad-blocker