Liquid biopsies offer a less invasive alternative to traditional tissue biopsies for disease diagnosis by detecting biomarkers in body fluids. This project aims to use differential expression analysis to identify predictive genes and train machine learning models, en- hancing early diagnosis models for preeclampsia and liver cancer using cell-free RNA data present in plasma. Comparing to baseline models from previous studies, our results reveal a significant improvement in early liver cancer prediction, but not in preeclampsia. This underscores the potential of liquid biopsies in identifying biomarkers but also highlights the need for further research to address current limitations.
This is the supporting website for the data analyses of the master’s thesis project entitled “Biomarker discovery in cfRNA data” at the Master Program of Data Science Methodology from the Barcelona School of Economics, in the academic year 2023-24. The analyses presented here are based on gene expression data stored in tables of counts obtained from RNA sequencing (RNA-seq) experiments conducted in the following publications:
The tables of counts were obtained by processing the raw FASTQ files deposited by the authors of the previous publications at the Gene Expression Omnibus (GEO) repository. The processing of these FASTQ files was performed by Beatriz Calvo-Serra and Robert Castelo.