Analyzing the unexplained regions spreadsheet
Last updated
Last updated
During a previous section of this tutorial, a spreadsheet named unexplained_regions was generated. This spreadsheet contains locations where reads map to the genome but are not annotated by the transcript database, in this case, RefSeqGene. The unexplained_regions spreadsheet is potentially very interesting as it may contain novel findings.
Right click column 6. Average Coverage and select Sort Descending from the menu
Select Find Overlapping Genes from the Tools option in the command toolbar (Figure 1)
Figure 1. Selecting Find Overlapping Genes from Tools in the command toolbar
Select Add a new column with the gene nearest to the region in the Find Overlapping Genes dialog (Figure 2)
Select OK
Figure 2. Find Overlapping Genes
Select RefSeq****Transcripts – 2017-05-02 from the Output Overlapping Features dialog (Figure 3)
Please note that it is recommended that you annotate with the same database used when you performed mRNA quantification.
Select OK
Figure 3. Select the database to search for overlapping features
The closest overlapping feature and the distance to it is now included as columns 7. Overlapping Features and _8. Nearest Features i_n the unexplained_regions spreadsheet.
Right-clicking on a row header and selecting Browse to Location will show the reads mapped to the chromosome. For this tutorial, a couple of genes are selected to show regions that are located after a known gene or in the intron of a gene.
Right-click row 39 and select Browse to location from the pop-up menu
Select the Chromosome View tab to view a region within an intron of UNC45B. This may be a novel exon (Figure 4)
Figure 4. A region within an intron of UNC45B that might be an novel exon
Right-click row 12576 and select Browse to location to go to a region that starts 1 bp after CD82.
This peak may represent an extended exon (Figure 5).
Figure 5. A region that starts 1 bp after CD82 that might represent an extended exon
While RefSeq was used to identify overlapping features, the choice of which database to use will depend on the biological context of your experiment. For example, you may wish to utilize promoter or miRNA databases if you are interested in regulation of expression.
If you need additional assistance, please visit our support page to submit a help ticket or find phone numbers for regional support.
Select () several times to zoom out slightly