# Importing a GEO / ENA project

* [How to import a study from GEO / ENA](#how-to-import-a-study-from-geo--ena)
* [Common Issues](#common-issues)
  * [Error Message - The project did not yield any data. Double-check the project ID, or try importing the data manually](#error-message---the-project-did-not-yield-any-data-double-check-the-project-id-or-try-importing-the-data-manually)
  * [The project was imported, but the Analyses tab is empty and there are no FASTQ files](#the-project-was-imported-but-the-analyses-tab-is-empty-and-there-are-no-fastq-files)
  * [Something is missing or the import failed](#something-is-missing-or-the-import-failed)
* [FAQ](#faq)
  * [What are GEO and ENA?](#what-are-geo-and-ena)
  * [How do I know if a GEO project is also in ENA?](#how-do-i-know-if-a-geo-project-is-also-in-ena)

## How to import a study from GEO / ENA

If a project is publicly available in the Gene Expression Omnibus (GEO) and European Nucleotide Archive (ENA) databases, you can import associated FASTQ files, sample attributes, and project details automatically into Partek Flow.

* Click **Projects** at the top of the page
* Click **Import project**

<figure><img src="https://1384254481-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FJVEESmJAPppJ3ijFq5aR%2Fuploads%2Fgit-blob-e9f2fb625c099802bbff7854086cedff88feaac8%2Fimporting-project-invoke.png?alt=media" alt=""><figcaption><p><em>Figure 1. Importing project invoke</em></p></figcaption></figure>

* Choose **GEO / ENA project** for Select files from
* Type the BioProject ID or the GEO Accession number

<figure><img src="https://1384254481-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FJVEESmJAPppJ3ijFq5aR%2Fuploads%2Fgit-blob-16defef1631af3101c9cce23591cea11b28ab645%2Fenter-the-bioproject-id-in-the-import-project-dialog.png?alt=media" alt=""><figcaption><p><em>Figure 2. Enter the Bioproject ID in the Import project dialog</em></p></figcaption></figure>

The format of a BioProject ID is PRJNA followed by one to six numbers (e.g., PRJNA291540). The format of a GEO Accession number is GSE followed by one to five numbers (e.g., GSE71578).

* Click **Import project** at the bottom

The **Analyses tab** will include an Unaligned reads data node once the data download has started (Figure 3). It may take a while for the download to complete depending on the size of the data. FASTQ files are downloaded from the ENA BioProject page.

<figure><img src="https://1384254481-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FJVEESmJAPppJ3ijFq5aR%2Fuploads%2Fgit-blob-324fcd11cc51dbc11c7ca4a8cde345608434c17e%2Ffastq-files-will-be-added-as-an-unaligned-reads-data-node-in-the-analyses-tab.png?alt=media" alt=""><figcaption><p><em>Figure 3. FASTQ files will be added as an Unaligned reads data node in the Analyses tab</em></p></figcaption></figure>

## Common Issues

### Error Message - The project did not yield any data. Double-check the project ID, or try importing the data manually

If the study is not publicly available in both GEO and ENA, project import will not succeed.

### The project was imported, but the Analyses tab is empty and there are no FASTQ files

If there is an ENA project, but the FASTQ files are not available through ENA, the project will be created, but data will not be imported.

### Something is missing or the import failed

A variety of other issues and irregularities can cause imports to not succeed or partially succeed, including, but not limited to, a BioProject having multiple associated GSE IDs, incomplete information on the GEO or ENA page, and either the GEO or ENA project not being publicly available.

## FAQ

### What are GEO and ENA?

The Gene Expression Omnibus (GEO) and the European Nucleotide Archive (ENA) are web-accessible public repositories for genomic data and experiments. Access and learn more about their resources at their respective websites:

* GEO - <https://www.ncbi.nlm.nih.gov/geo/>
* ENA - <https://www.ebi.ac.uk/ena>

### How do I know if a GEO project is also in ENA?

You can search ENA using the GEO ID (e.g., GSE71578) to check if there is a matching ENA project (Figure 6).

<figure><img src="https://1384254481-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FJVEESmJAPppJ3ijFq5aR%2Fuploads%2Fgit-blob-974dbc67a0c6ced43f3b24915827e32564234d27%2Fsearching-ena-using-a-geo-id.png?alt=media" alt=""><figcaption><p><em>Figure 4. Searching ENA using a GEO ID</em></p></figcaption></figure>

<figure><img src="https://1384254481-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FJVEESmJAPppJ3ijFq5aR%2Fuploads%2Fgit-blob-60a89dcc5c323d0feb5334d9b33acde3720336e2%2Fena-study-page.png?alt=media" alt=""><figcaption><p><em>Figure 5. ENA Study page</em></p></figcaption></figure>

## Additional Assistance

If you need additional assistance, please visit [our support page](http://www.partek.com/support) to submit a help ticket or find phone numbers for regional support.
