# Adding an Annotation Model

On the [library file management page](https://help.partek.illumina.com/partek-flow/user-manual/settings/components/library-file-management/library-file-management-page), click the **+** **Add annotation models** button in the *Annotation models* section, choose **Gene/feature annotation** from the *Library type* drop-down list in the dialog (Figure 1).

If you are using an assembly supported by Partek (e.g. human), annotation models from a variety of commonly used sources (e.g. ENSEMBL, GENCODE) will appear in the *Annotation model* drop-down list in the dialog. Choose an annotation model, select the **Download annotation file** radio button and click **Create** (Figure 1).

![Figure 1. For many model organisms, automatic downloads of gene/feature annotation models from various sources are available from the Partek repository.](https://1384254481-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FJVEESmJAPppJ3ijFq5aR%2Fuploads%2Fgit-blob-6ab32748f68d4269d9be995cb80d62c9cd04fc86%2Fadd-annotation-model-dialog.png?alt=media)

If you prefer to add a custom annotation file or if you are working with a custom assembly, choose **Add annotation model** from the *Annotation model* drop-down list (Figure 2). You may need to scroll to the bottom of the drop-down list to see this option. Name the annotation model by typing into the *Custom name* box and click **Create**. Characters such as $ \* | \ : " < > ? / % cannot be used in custom names.

![Figure 2. Add a custom annotation model](https://1384254481-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FJVEESmJAPppJ3ijFq5aR%2Fuploads%2Fgit-blob-82b777068c4aea4a14d069f595e1b9558751bbe8%2Fadd-custome-anno-model.png?alt=media)

A custom annotation model can be added from the *Partek Flow Server* or a *URL* download link. The behavior of each option is similar to when importing a reference sequence (see [*Adding a Reference Sequence*](https://help.partek.illumina.com/partek-flow/user-manual/settings/components/library-file-management/adding-a-reference-sequence)). When browsing for files on the Partek Flow server, only the files with relevant file extensions will be visible (.gtf, .gff, .gff3, .bed, .pannot, .txt and various compressed formats).

Select the source of annotation file and click *Next.* If the annotation file format is gtf, gff, gff3 or bed, a preview of the first 10 rows of the file will be shown on the screen (Figure 3). You must then specify the type of annotation file by choosing an option from the *Annotation type* drop-down list (Figure 3). Choose *mRNA* for whole transcriptome annotations, where both gene and transcript level information are present. Choose *microRNA* for precursor or mature microRNA transcript annotations. Choose *Amplicon* for targeted amplicon sequencing annotations. Choose *Other* for annotation files that do not fall into the other four categories (e.g. lncRNA).

![Figure 3. When importing an annotation model, the data type must be specified](https://1384254481-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FJVEESmJAPppJ3ijFq5aR%2Fuploads%2Fgit-blob-06f1f65288f77e4ad1c562537fb3984f2adf5451%2Fpreview-anno-model.png?alt=media)

For text annotations with both gene- and transcript-level information select *mRNA* from the drop-down menu. You can manually specify which column corresponds to the **Transcript ID** or **Gene ID** by selecting the corresponding radio button next to the column with that information (Figure 4).

![Figure 4. Select the column with gene- and transcript level information](https://1384254481-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FJVEESmJAPppJ3ijFq5aR%2Fuploads%2Fgit-blob-a4d21e9f30df4077288265c6eb5ee9bf3c9c32bb%2Fselect-geneID-transcriptID.png?alt=media)

Genome coordinates for annotation models stored in Partek Flow are 1-based, start-inclusive, and stop-exclusive. This means that the first base position starts from one, the start coordinate for a feature is included in the feature and the stop/end coordinate is not included in the feature. These are the genome coordinates that are printed in various task reports and output files when an annotation model is involved in the task. When custom annotation files are added to Partek Flow, the genome coordinates are converted into this format. The coordinates are converted back if necessary for a specific task. Figure 5 shows how the genome coordinates vary between different annotation formats.

![Figure 5. Comparison of genome coordinate formats. The example column shows the coordinates for a feature that starts at position 1000 (inclusive) and stops at position 2000 (inclusive).](https://1384254481-files.gitbook.io/~/files/v0/b/gitbook-x-prod.appspot.com/o/spaces%2FJVEESmJAPppJ3ijFq5aR%2Fuploads%2Fgit-blob-62ec1d304bd259c3e91b7d46041b3ed3ed9abbda%2Fare-genome-coordinates-1-based-or-0-based.png?alt=media)

## Additional Assistance

If you need additional assistance, please visit [our support page](http://www.partek.com/support) to submit a help ticket or find phone numbers for regional support.
