Graduation Year

2018

Document Type

Thesis

Degree

M.S.C.S.

Degree Name

MS in Computer Science (M.S.C.S.)

Degree Granting Department

Computer Science and Engineering

Major Professor

Lawrence Hall, Ph.D.

Committee Member

Dmitry Goldgof, Ph.D.

Committee Member

Sudeep Sarkar, Ph.D.

Keywords

Classification, Computer-Aided Diagnosis, Lung Cancer, Prognosis, Radiomics

Abstract

Computed tomography (CT) imagery is an important weapon in the fight against lung cancer; various forms of lung cancer are routinely diagnosed from CT imagery. The growth of the suspect nodule is known to be a prognostic factor in the diagnosis of pulmonary cancer, but the change in other aspects of the nodule, such as its aspect ratio, density, spiculation, or other features usable for machine learning, may also provide prognostic information.

We hypothesized that adding combined feature information from multiple CT image sets separated in time could provide a more accurate determination of nodule malignancy. To this end, we combined data from multiple CT images for individual patients taken from the National Lung Screening Trial. The resulting dataset was compared to equivalent datasets featuring single CT images for each patient. Feature reduction and normalization was performed as is standard.

The highest accuracy achieved was 83.71% on a subset of features chosen by a combination of manual feature stability testing and the Correlation-based Feature Selection algorithm and classified by the Random Forests algorithm. The highest accuracy achieved with individual CT images was 81.00%, on a feature set consisting solely of the volume of the nodule in cubic centimeters.

Share

COinS