How does data augmentation alter classification performance of support vector machines and convoluted neural network models developed using transmission low frequency Raman spectra?

Sara Miller; Mitchell C. Chalmers; Brendan McCane; Keith Gordon

Back

How does data augmentation alter classification performance of support vector machines and convoluted neural network models developed using transmission low frequency Raman spectra?

Conference proceeding

Open access

How does data augmentation alter classification performance of support vector machines and convoluted neural network models developed using transmission low frequency Raman spectra?

Sara Miller, Mitchell C. Chalmers, Brendan McCane and Keith Gordon

Proceedings of the Te Whai Ao Dodd-Walls Centre (DWC) Symposium

Te Whai Ao - Dodd-Walls Centre Symposium 2024 (Christchurch, New Zealand, 18/11/2024–22/11/2024)

11/2024

Handle:

https://hdl.handle.net/10523/47473

Abstract

Deep Raman spectroscopic techniques have been highlighted as a potential avenue for disease diagnosis and characterizing disease states. Transmission Raman spectroscopy has been explored in the literature for applications such as breast cancer detection as a non-invasive, non-ionising and chemically specific characterization method to target detection of breast microcalcification composition. However the sensitivity has not yet reached levels for uptake in clinic. The use of the low wavenumber analogue to transmission Raman (transmission low frequency Raman) provides additional information on the order of solid microcalcifications which is proposed to add information to increase the sensitivity of the approach. In addition, the multivariate classification methods (e.g. convolutional neural networks and support vector machines) used for diagnosis need to be further optimised. The stability of two machine learning techniques were probed by intentionally introducing spectral artefacts to the transmission low frequency Raman spectroscopic data collected from calcifications (calcium oxalate, crystalline, intermediate and amorphous hydroxyapatite) buried in chicken breast. SVM yielded a slightly better model with an AUC of 0.989 compared to 0.979 for the CNN. However, in general SVM were found to be more susceptible to spectral artefacts than CNN. Additionally, the performance of the CNNs and SVMs was not dependent on the magnitude of the shifts and stretches in the augmented data. An example is the linear- stretching of the data where the AUC remained at 0.977 and 0.969 for both 2 cm-1 and 5 cm-1 shifts for CNN and SVM, respectively.

Files and links (1)

url

Link to Book of AbstractsView

Metrics

1 Record Views

Details

Record Identifier: 9926760642401891
Title: How does data augmentation alter classification performance of support vector machines and convoluted neural network models developed using transmission low frequency Raman spectra?
Creators: Sara Miller
Mitchell C. Chalmers
Brendan McCane
Keith Gordon
Publication Details: Proceedings of the Te Whai Ao Dodd-Walls Centre (DWC) Symposium
Conference: Te Whai Ao - Dodd-Walls Centre Symposium 2024 (Christchurch, New Zealand, 18/11/2024–22/11/2024)
Academic Unit: Dodd-Walls Centre for Photonic and Quantum Tech; Chemistry; School of Computing
Publisher: Dodd-Walls Centre
Date published ; e-published: 11/2024
Comment: The published version is not available in full-text in OUR Archive. Where available, a link to the published version is provided (check the DOI and/or the Files and links section). The full-text item may be open access on the publisher's website. An earlier version of the work (such as authors' accepted manuscript following peer-review or unreviewed preprint/author's original version) may be available in the Files and links section of this record. Alternatively, readers may have subscription access to the full-text from the publisher.
Language: English
Resource Type ; Subtype: Conference proceeding; Conference Abstract