# Analyze differentially expressed miRNAs

Typically, you would begin a miRNA expression analysis with the same steps outlined in the [Importing Affymetrix CEL files](https://help.partek.illumina.com/partek-genomics-suite/tutorials/gene-expression-analysis/importing-affymetrix-cel-files) section of the *Gene Expression* tutorial. Here, the data has already been imported and attributes added.

To being our analysis, we will open the *miRNA Expression* workflow.

* Select the **miRNA Expression** workflow from the *Workflows* drop-down menu

The *miRNA Expression* workflow provides a series of steps for analyzing miRNA expression data and integrating it with gene expression data (Figure 1).

![](https://1384254481-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FJVEESmJAPppJ3ijFq5aR%2Fuploads%2Fgit-blob-1b4a41c07374c1f24d9506799ab5d425315267ba%2Fimage2017-12-12%2014_23_17.png?alt=media)

Figure 1. The miRNA Expression workflow

## Exploratory data analysis

Principal Components Analysis (PCA) is an excellent method to visualize similarities and differences between the samples in a data set. PCA can be invoked through a workflow, by selecting ![](https://1384254481-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FJVEESmJAPppJ3ijFq5aR%2Fuploads%2Fgit-blob-f16a0d6bb4c88adcf5204f044f59ce994c5edc98%2Fimage2018-1-8%2016_1_26.png?alt=media) from the main command bar, or by selecting Scatter Plot from the View section of the main toolbar. We will use a workflow.

* Select the **Affy\_miR\_BrainHeart\_intensities** spreadsheet

This is the probe intensities spreadsheet for the miRNA expression data (Figure 2). Each row is a sample; columns 7 to 9 give attribute information about each sample including tissue, replicate number, and scan date, while columns 10 on give prove intensities values.

![](https://1384254481-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FJVEESmJAPppJ3ijFq5aR%2Fuploads%2Fgit-blob-84195da27e467caa300eafeffadd3b6ecab8b473%2Fimage2017-12-12%2014_31_34.png?alt=media)

Figure 2. Viewing the miRNA probe intensities spreadsheet

* Select **PCA Scatter Plot** from the *QA/QC* section of the workflow

A new tab will open showing a PCA scatter plot (Figure 3).

![](https://1384254481-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FJVEESmJAPppJ3ijFq5aR%2Fuploads%2Fgit-blob-0d373b69d21128f27437a3be8c0dc62d0012c136%2Fimage2017-12-12%2014_44_54.png?alt=media)

Figure 3. PCA scatter plot. Samples are spheres. Samples with more similar miRNA expression are close together while dissimilar samples are further apart.

In this PCA scatter plot, each point represents a sample in the spreadsheet. Points that are close together in the plot are more similar, while points that are far apart in the plot are more dissimilar.

To better view the data, we can rotate the plot.

* Select (![](https://1384254481-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FJVEESmJAPppJ3ijFq5aR%2Fuploads%2Fgit-blob-cd74857bb6d4ef1a197883fd046b1134426b9b7c%2Fimage2017-8-24%2010_51_39.png?alt=media)) to activate *Rotate Mode*
* Click and drag to rotate the plot

Rotating the plot allows us to look for outliers in the data on each of the three principal components (PC1-3). The percentage of the total variation explained by each PC is listed by its axis label. The chart label shows the sum percentage of the total variation explained by the displayed PCs.

Here, we can see that the brain and heart samples are well separated across PC1, which is expected.

For more information about customizing the plot, please see [Exploring the data set with PCA](https://help.partek.illumina.com/partek-genomics-suite/tutorials/gene-expression-analysis-with-batch-effects/exploring-the-data-set-with-pca) from the *Gene Expression with Batch Effect* tutorial.

## Detecting differentially expressed miRNAs

Next, we will identify miRNAs that are differentially expressed between brain and heart tissues.

* Select the **Analysis** tab
* Select the **Affy\_miR\_BrainHeart\_intensities** spreadsheet
* Select **Detect Differentially Expressed miRNAs** from the *Analysis* section of the workflow

The *ANOVA* dialog (Figure 4) allows us to configure the comparisons we want to make between samples and groups within the data set.

![](https://1384254481-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FJVEESmJAPppJ3ijFq5aR%2Fuploads%2Fgit-blob-633d9e1f0b1f4d91342aacece41efc7f977e0647%2Fimage2017-12-12%2014_54_14.png?alt=media)

Figure 4. ANOVA dialog

* Select **Tissue** from the *Experimental Factor(s)* panel
* Select **Add Factor >** to move *Tissue* to the *ANOVA Factor(s)* panel

The *Contrasts...* button will now be available to select.

* Select **Contrasts...**

The *Configure ANOVA* dialog (Figure 5) is used to set up contrasts. Contrasts are the comparisons between groups and are where experimental questions can be asked. In this study, we are asking what miRNAs are differentially expressed between heart and brain tissue.

![](https://1384254481-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FJVEESmJAPppJ3ijFq5aR%2Fuploads%2Fgit-blob-c012deb914c34c1a82cba32da553fe4de4b56384%2Fimage2017-12-12%2015_0_6.png?alt=media)

Figure 5. ANOVA configuration dialog

* Select **Yes** for *Data is already log transformed?*
* Select **Fold change** for *Report comparisons as*
* Select **7. Tissue** from the *Select Factor/Interaction* drop-down menu
* Select **brain** from the left panel
* Select **Add Contrast Level >** to move *brain* to the upper group - initially Group 1
* Select **heart** from the left panel
* Select **Add Contrast Level >** to move *heart* to the lower group - initially Group 2

This contrast (Figure 6) will compare expression of miRNAs in brain samples to expression in heart samples with brain as the numerator and heart as the denominator for fold-change calculations.

![](https://1384254481-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FJVEESmJAPppJ3ijFq5aR%2Fuploads%2Fgit-blob-6ee75e7cf4d94ed922a4385c9b475878d54f7d95%2Fimage2017-12-12%2015_4_26.png?alt=media)

Figure 6. Configuring a contrast between brain and heart tissue in the ANOVA dialog

* Select **Add Contrast**
* Select **OK**

The *Contrasts...* button should now read *Contrasts Included.*

* Select **OK** to run the ANOVA as configured

An ANOVA Results sheet, *ANOVAResults*, will be created as a child spreadsheet of *Affy\_miR\_BrainHeart\_intensities* (Figure 7). In this spreadsheet, each row represents a probe set and the columns represent the computation results for that probe set. Although not synonymous, probe set and gene will be treated as synonyms in this tutorial for convenience. By default, the genes are sorted in ascending order by the p-value of the first categorical factor, which, in this case, is *Tissue*. This means the most significant differentially expressed miRNAs between the brain and heart (up-regulated and donw-regulated) are at the top of the spreadsheet.

![](https://1384254481-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FJVEESmJAPppJ3ijFq5aR%2Fuploads%2Fgit-blob-4542157fae00b3d96464f9791d5edf502c287132%2Fimage2017-12-12%2015_25_46.png?alt=media)

Figure 7. Viewing the ANOVA results spreadsheet

You may explore what is known about any listed miRNA using external databases TargetScan, miRBase, microRNA.org, or miR2Disease, by right-clicking a row header, selecting *Find miRNA in...* and choosing one of the external databases. This will open a web page in your default web browser and requires your computer be connected to the internet.

For more information about AVOVA in Partek Genomics Suite, see [Identifying differentially expressed genes using ANOVA](https://help.partek.illumina.com/partek-genomics-suite/tutorials/gene-expression-analysis/identifying-differentially-expressed-genes-using-anova).

## Creating a list of miRNAs of interest

The ANOVA results spreadsheet includes every miRNA on the array for a total of 7815 miRNAs. However, many of these miRNAs are not significantly differentially expressed between brain and heart and, thus, are not of interest. Next, we will create a filtered list of significantly differentially expressed miRNAs.

* Select the **ANOVAResults** spreadsheet
* Select **Create List** from the *Analysis* section of the workflow

The *List Manager* dialog will open (Figure 8).

* Select **brain vs. heart** under *Contrast: find genes that change between two categories*

By default, the fold-change and significance thresholds are set to > 2, < -2 and p-value with FDR < 0.05. These defaults are appropriate for this tutorial so we will leave them in place.

* Select **Create** to create a new list, *brain vs. heart* containing only the 1404 miRNAs that pass the criteria

![](https://1384254481-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FJVEESmJAPppJ3ijFq5aR%2Fuploads%2Fgit-blob-cc308ababde9d47e80d0d547754659c00dd71377%2Fimage2017-12-12%2015_34_48.png?alt=media)

Figure 8. Creating a list of significantly differentially expressed miRNAs

A new spreadsheet, *brain vs. heart* will be created as a child spreadsheet of *Affy\_miR\_BrainHeart* (Figure 9).

![](https://1384254481-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FJVEESmJAPppJ3ijFq5aR%2Fuploads%2Fgit-blob-2a40d4d43914e733cb5b616b9206ad4578bde8c5%2Fimage2017-12-15%2013_45_38.png?alt=media)

Figure 9. Viewing brain vs. heart spreadsheet

To view the miRNAs with the largest difference between tissues, we can sort by fold-change.

* Right-click the *6. Fold-Change(brain vs. heart)* column header
* Select **Sort Descending by Absolute Value** from the pop-up menu

The top 33 miRNAs we see (Figure 10) are all miR-124 from different species. The miRNA miR-124 is the most abundant miRNA in neuronal cells so this finding is expected. The multiple species versions of miR-124 are present because Affymetrix GeneChip miRNA arrays provide comprehensive coverage of miRNAs from multiple organisms including human, mouse, rat, canine, monkey, and many more on a single chip. The miRNAs from these different species are highly homologous so probes targeting miRNAs from other species will hybridize with human miRNAs. Therefore, we need to filter the list of miRNAs to include only human miRNAs.

![](https://1384254481-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FJVEESmJAPppJ3ijFq5aR%2Fuploads%2Fgit-blob-d7d1fb8d3315d17f55ae09c7907daede691a9daa%2Fimage2017-12-15%2013_55_41.png?alt=media)

Figure 10. miR-124 is highly differentially expressed in brain vs. heart

To do this, we need to add a new annotation column containing species information for each probe.

* Right-click on the *2. Probeset ID* column header
* Select **Insert Annotation** from the pop-up menu
* Select **Add as categorical**
* Check **Species Scientific Name** (Figure 11)
* Select **OK** to add the annotation column

![](https://1384254481-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FJVEESmJAPppJ3ijFq5aR%2Fuploads%2Fgit-blob-fc02273a17e4908b94bbfa6a9da85889b63a7918%2Fimage2017-12-15%2014_4_19.png?alt=media)

Figure 11. Inserting species annotation column

The table now includes a column *3. Species Scientific Name* with the species name of each miRNA. We can now filter to include only human miRNAs.

* Right-click the *3. Species Scientific Name* column header
* Select **Find / Replace / Select...** from the pop-up menu
* Type **Homo sapiens** for *Find What*
* Select **Only in column** for *Search*
* Select **3. Species Scientific Name** from the drop-down menu next to the *Only in column* option
* Select **Select All** (Figure 12)

![](https://1384254481-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FJVEESmJAPppJ3ijFq5aR%2Fuploads%2Fgit-blob-d5933f07d756e205e2b64d955d4ec7dc35e67d8d%2Fimage2017-12-15%2014_12_11.png?alt=media)

Figure 12. Configuring the Find / / Replace / Select... dialog

The search should find and select 251 miRNAs.

* Select **Close**
* Right-click any of the row headers that are selected
* Select **Filter Include** from the pop-up menu

The spreadsheet will now include only the 251 miRNAs from human (Figure 13). The first row is still miR-124 with a fold change of 4087.94. The black and gold bar on the right-hand side of the spreadsheet indicates the fraction of rows that have been filtered. To retain this filtered list, we can create a new spreadsheet.

![](https://1384254481-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FJVEESmJAPppJ3ijFq5aR%2Fuploads%2Fgit-blob-3150f6b850dc2f5bb8291094c84be649bcf64ed7%2Fimage2017-12-15%2014_16_30.png?alt=media)

Figure 13. Viewing differentially expressed human miRNAs

* Right-click the *brain\_vs\_heart* spreadsheet in the spreadsheet tree
* Select **Clone...** from the pop-up menu

Cloning a spreadsheet while a filter is applied copies only the included rows/columns.

* Name the spreadsheet **brain\_vs\_heart\_human**
* Select **Affy\_miR\_BrainHeart\_intensities** from the drop-down menu *Create new spreadsheet as a child of spreadsheet*
* Select ![](https://1384254481-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FJVEESmJAPppJ3ijFq5aR%2Fuploads%2Fgit-blob-745449868a6951e5cbd330d115bbe94495c63e37%2Fimage2017-9-26%2013_31_19.png?alt=media)
* Name the new file **brain vs. heart human**
* Select **Save**

The new spreadsheet includes only the 251 human miRNAs that are significantly differentially expressed between brain and heart tissue (Figure 14).

![](https://1384254481-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FJVEESmJAPppJ3ijFq5aR%2Fuploads%2Fgit-blob-be1ed37a6bb600fc4160fca1ea37553dfc54d21e%2Fimage2017-12-15%2014_27_3.png?alt=media)

Figure 14. Viewing the filtered human miRNAs spreadsheet

The next step in our analysis will be integrating miRNA and gene expression data.

## Additional Assistance

If you need additional assistance, please visit [our support page](http://www.partek.com/support) to submit a help ticket or find phone numbers for regional support.
