Skip to content

SampleExplorer: using language models to discover relevant transcriptome data

Over the last two decades, transcriptomics has become a standard technique in biomedical research. We now have large databases of RNA-seq data, accompanied by valuable metadata detailing scientific objectives and the experimental procedures used. The metadata is crucial in understanding and replicating published studies, but so far has been underutilized in helping researchers to discover existing datasets.

Citation:
Chin WL, Lassmann T. SampleExplorer: using language models to discover relevant transcriptome data. Bioinformatics. 2024;41(1).

Keywords:
Computational biology; methods; gene expression profiling; transcriptome; genetics

Abstract:
Over the last two decades, transcriptomics has become a standard technique in biomedical research. We now have large databases of RNA-seq data, accompanied by valuable metadata detailing scientific objectives and the experimental procedures used. The metadata is crucial in understanding and replicating published studies, but so far has been underutilized in helping researchers to discover existing datasets.