Hello!
For this week, I did more on UCSC. This time I looked at the
gene that encodes my protein, which you will all recognize: CHIA!
GTEx Results
Description of Data: Homo sapiens chitinase acidic (CHIA), transcript variant 4, mRNA. (from RefSeq NM_201653)
Methods: Tissue samples were obtained using the GTEx standard operating procedures for informed consent and tissue collection, in conjunction with the National Cancer Institute Biorepositories and Biospecimen. All tissue specimens were reviewed by pathologists to characterize and verify organ source. Images from stained tissue samples can be viewed via the NCI histopathology viewer. The Qiagen PAXgene non-formalin tissue preservation product was used to stabilize tissue specimens without cross-linking biomolecules.
RNA-seq was performed by the GTEx Laboratory, Data Analysis and Coordinating Center (LDACC) at the Broad Institute. The Illumina TruSeq protocol was used to create an unstranded polyA+ library sequenced on the Illumina HiSeq 2000 and HiSeq 2500 platforms to produce 76-bp paired end reads with a coverage goal of 50M (median achieved was ~82M total reads).
Sequence reads were aligned to the hg38/GRCh38 human genome using STAR v2.5.3a assisted by the GENCODE 26 transcriptome definition. The alignment pipeline is available here.
Gene annotations were produced using a custom isoform collapsing procedure that excluded retained intron and read through transcripts, merged overlapping exon intervals and then excluded exon intervals overlapping between genes. Gene expression levels in TPM were called via the RNA-SeQC tool (v1.1.9), after filtering for unique mapping, proper pairing, and exon overlap. For further method details, see the GTEx Portal Documentation page.
UCSC obtained the gene-level expression files, gene annotations and sample metadata from the GTEx Portal Download page. Median expression level in TPM was computed per gene/per tissue.
These are the GTEx results. This image shows where the CHIA gene is expressed. As you can see, it is highly expressed in the stomach. If you look closely, you can also see it is less expressed in the lung, Esophagus- Mucosa, and Esophagus - Gastroesophageal Junction, and Adipose - Visceral.
If you wanted to search the transcriptome for an unknown sequence would you use microarray technology or RNA sequencing?
You would use RNA sequencing.
What is the major difference between microarray technology and RNA sequencing.
Microarrays require a predefined transcripts, whereas RNA sequencing lets you look at the whole transcriptome.
UCSC Genome Browser: Kent WJ, Sugnet CW, Furey TS, Roskin KM, Pringle TH, Zahler AM, Haussler D. The human genome browser at UCSC. Genome Res. 2002 Jun;12(6):996-1006. https://genome.cshlp.org/content/12/6/996.abstract
Track Data Hubs: Brian J. Raney, Timothy R. Dreszer, Galt P. Barber, Hiram Clawson, Pauline A. Fujita, Ting Wang, Ngan Nguyen, Benedict Paten, Ann S. Zweig, Donna Karolchik, W. James Kent, Track data hubs enable visualization of user-defined genome-wide annotations on the UCSC Genome Browser, Bioinformatics, Volume 30, Issue 7, 1 April 2014, Pages 1003–1005, https://doi.org/10.1093/bioinformatics/btt637



Comments
Post a Comment