# Biological Interpretation

## What is the difference between GSEA and Gene Set Enrichment? <a href="#what-is-the-difference-between-gsea-and-gene-set-enrichment" id="what-is-the-difference-between-gsea-and-gene-set-enrichment"></a>

In Partek Flow, GSEA should be performed on a sample/cell and feature matrix data node (e.g. normalization count data). GSEA is used to detect a gene set/a pathway which is significantly different between two groups. Gene set enrichment should be performed on a filtered gene list; it is used to identify overrepresented gene set/pathway based the filtered gene list using Fisher's exact test. The input data is a filtered list using gene names.

## What is the enrichment score shown in the Gene Set Enrichment report? <a href="#what-is-the-enrichment-score-shown-in-the-gene-set-enrichment-report" id="what-is-the-enrichment-score-shown-in-the-gene-set-enrichment-report"></a>

The enrichment score shown in the enrichment report is the negative natural log of the enrichment p-value derived from Fisher Exact test. The higher the enrichment score, the more overrepresented our list of genes in the gene set of a GO/pathway category.

## In KEGG pathway, genes can be colored by Fold change and p-value etc, how are the gene statistics calculated? <a href="#in-kegg-pathway-genes-can-be-colored-by-fold-change-and-p-value-etc-how-are-the-gene-statistics-calc" id="in-kegg-pathway-genes-can-be-colored-by-fold-change-and-p-value-etc-how-are-the-gene-statistics-calc"></a>

For Gene set enrichment analysis, only genes from the input data node (filtered gene list) will be colored in the KEGG pathway gene network, using the statistics in the data node.

During GSEA (or Gene set ANOVA) computation, we also perform ANOVA on each gene based on the attributed selected independent from GESA computation (at gene set level). The results of ANOVA is only used to color the genes in the KEGG gene network. If GSEA is computed using another other database, e.g. GO, we don't compute ANOVA on each gene since GO databased doesn't have gene network information.

## When should I use GSEA or Gene set ANOVA? <a href="#when-should-i-use-gsea-or-gene-set-anova" id="when-should-i-use-gsea-or-gene-set-anova"></a>

Both methods should be performed on a normalized matrix data node, and requires gene symbol in feature annotation. Both methods are detecting a differentially expressed Gene set (pathway) instead of each individual gene. The algorithms are different. GSEA is a popular method from the [Broad institute](https://www.gsea-msigdb.org/gsea/index.jsp). Gene Set ANOVA is based on generalized linear model, [here](https://help.partek.illumina.com/partek-flow/white-papers/gene-set-anova) are the details.


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://help.partek.illumina.com/partek-flow/frequently-asked-questions/biological-interpretation.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
