> For the complete documentation index, see [llms.txt](https://help.partek.illumina.com/llms.txt). Markdown versions of documentation pages are available by appending `.md` to page URLs; this page is available as [Markdown](https://help.partek.illumina.com/partek-flow/user-manual/task-menu/10x-genomics/cell-ranger-atac.md). # Cell Ranger - ATAC * [What is Cell Ranger?](#what-is-cell-ranger) * [Cell Ranger - ATAC in Partek Flow](#cell-ranger---atac-in-partek-flow) * [Importing Cell Ranger - ATAC data in a Partek Flow project](#importing-cell-ranger---atac-data-in-a-partek-flow-project) * [Running Cell Ranger - ATAC in Flow](#running-cell-ranger---atac-in-flow) * [Cell Ranger - ATAC task report in Flow](#cell-ranger---atac-task-report-in-flow) * [Import Cell Ranger output into Flow](#import-cell-ranger-output-into-flow) * [Import FASTQs for Cell Ranger - ATAC in Flow](#import-fastqs-for-cell-ranger---atac-in-flow) * [References](#references) *** ## What is Cell Ranger? Cell Ranger is a set of analysis pipelines that process Chromium single cell data to align reads, generate feature-barcode matrices, and perform clustering and gene expression analysis for 10X Genomics Chromium Technology\[1]. ## Cell Ranger - ATAC in Partek Flow **Cell Ranger - ATAC** task in Partek Flow includes two different wrappers. To deal with the single cell ATAC-Seq dataset, the '*cellranger-atac count*' pipeline from Cell Ranger ATAC v2.0\[2] has been wrapped in Flow. It takes FASTQ files from '*cellranger-atac mkfastq'* and performs ATAC analysis including reads filtering and alignment, barcode counting, identification of transposase cut sites, peak and cell calling, and count matrix generation. Its outputs then become the starting point for downstream analysis for scATAC-Seq data. To process Chromium Single Cell Multiome ATAC + Gene Expression sequencing data, ‘*cellranger-arc count*’ v2.0\[3] has been wrapped to generate a variety of analyses pertaining to gene expression, chromatin accessibility and their linkage. ## Importing Cell Ranger - ATAC data in a Partek Flow project When importing raw reads for processing using the **Cell Ranger - ATAC** task for scATAC-Seq data, the user is not required to specify the data type. If you are importing 10x multiome ATAC + Gene Expression data, first select the ATAC files choose data type ATAC-Seq and complete the import. Once the import task has successfully run, add the gene expression reads to each of the samples. Remember to specify mRNA in the data type during import. ## Running Cell Ranger - ATAC in Flow To run the **Cell Ranger - ATAC** task for scATAC-Seq data, select the **Unaligned reads** data node, then select **Cell Ranger - ATAC** in the **10x Genomics** section (top panel, Figure 1). For 10x multiome ATAC + Gene Expression data, there will be two data nodes once the FASTQ files have been imported into Flow properly - ATAC-Seq and mRNA (bottom panel, Figure 1). Users should select the ATAC-Seq datanode to trigger the **Cell ranger - ATAC** task.

Figure 1. Selecting the Cell Ranger - ATAC task for converting fastqs to Single cell counts.

Similar to the **Cell Ranger - Gene Expression** task, a first time user will be asked to create a Reference assembly. In Partek Flow, we will use *Cell Ranger ARC 2.0.0* to create a Reference assembly for all 10x Genomics analysis pipelines. Please refer to our [Cell Ranger - Gene Expression](/partek-flow/user-manual/task-menu/10x-genomics/cell-ranger-gene-expression.md) task manual on how to build or use Reference assembly. Once the right assembly has been chosen/provided, simply press the **Finish** button to run the task with default settings. The reference assembly of ‘*Homo sapiens (human) - hg38*’ has been used as an example here (Figure 2).

Figure 2. Run Cell Ranger - ATAC task in Flow.

The interface will be different for Single Cell Multiome ATAC + Gene Expression sequencing data because the gene expression data from the very same cell has to be paired with the ATAC-Seq data (Figure 3)

Figure 3. Run Cell Ranger - ATAC task for 10x Single Cell Multiome ATAC and Gene Expression data in Flow.

After the task has finished successfully, a new data node named **Single cell counts** will be displayed (Figure 4). This data node contains a filtered peak barcode count matrix for ATAC-Seq data, but a unified feature-barcode matrix that contains gene expression counts alongside ATAC-Seq peak counts for each cell barcode for multiomic data. To open the task report when the task is finished, double click the output data node, or select the **Task report** in the **Task results** section after single clicking the data node. The task report (Figure 5) is the same as the ‘Summary HTML’ from Cell Ranger ATAC output.

Figure 4. The finished Cell Ranger - ATAC task in Flow.

## Cell Ranger - ATAC task report in Flow The task report is sample based. Users can use the dropdown list on the top left to switch samples. Under the sample name, there are two tabs on each report - **Summary** report and **Data Quality** report (Figure 5). Important information on the **Estimated Number of Cells**, **Median high-quality fragments per cell**, **Fraction of high-quality fragments overlapping peaks**, as well as information on **Sample,** **Sequencing**, **Cells** and **Cell Clustering** are summarized in different panels. Importantly, the **Barcode Rank** plot and the **Fragment** **Distribution** plot have also been included in the **Cells** section of the **Summary** report (Figure 5). Descriptions of metrics in the following sections can also be found by clicking the

to the section header in the Summary HTML file itself.

Figure 5. The example report of Cell Ranger - ATAC task in Flow.

The **Library Complexity** section in **Data Quality** report plots the observed per cell complexity, measured as median unique fragments per cell, as a function of mean reads per cell (Figure 6). While the **Mapping** section displays the **Insert Size Distribution** plot, and metrics derived from it. Single Cell ATAC read pairs produce detailed information about nucleosome packing and positioning. The fragment length distribution captures the nucleosome positioning periodicity. The **Targeting** section shows profiling of the chromatin accessibility behavior of the library at epigenetically relevant regions in the genome. The **Enrichment around TSS** plot is helpful to assess the signal-to-noise ratio of the library. It is well known that Transcriptional Start Sites (TSSs) and the promoter regions around them have a higher degree of chromatin accessibility compared to other regions of the genome. The **Peaks targeting** plot presents the variation in the number of on-target fragments, or fragments that overlap peaks, within each barcode group. A higher percentage of the barcode fragments overlap peaks is expected for cell-associated barcodes.

Figure 6. Data Quality report of Cell Ranger - ATAC task in Flow.

The task report for multiomic data analysis is more complicated. It contains summary metrics at different levels - ATAC, gene expression, both/joint. Joint view is the default view visible upon first rendering the summary and can be accessed by clicking "Joint" at the top left corner. Metrics that are specific to the given Chromatin Accessibility library will appear in the ATAC tab. Lastly, metrics that are specific to the given gene expression library will appear in the Gene Expression tab (Figure 7). To understand the details, please refer to 10x Genomics webpage\[4].

Figure 7. Report for 10x Chromium Single Cell Multiome ATAC + Gene Expression data in Flow Cell Ranger - ATAC task.

Other adjustable parameters in this task (Figure 2) include: **Subsample percentile**: Downsample to preserve this fraction of reads. Users can also click **Configure** to change the default settings in **Advanced options** (Figure 2). **Override peak caller**: To override the peak caller, users specify peaks to use in downstream analyses from supplied 3-column BED file. The supplied peaks file must be sorted by position and not contain overlapping peaks; comment lines beginning with \`#\` are allowed. **Force cells**: Define the top N barcodes with the most fragments overlapping peaks as cells and override the cell calling algorithm. N must be a positive integer <= 20,000. Use this option if the number of cells estimated by **Cell Ranger -ATAC** is not consistent with the barcode rank plot. **Memory limit (GB)**: Restricts **Cell Ranger - ATAC** to use specified amount of memory (in GB) to execute pipeline stages. ## Import Cell Ranger output into Flow If users have converted FASTQ outside of Partek, the available count matrix can be imported along with additional files (Figure 8A). Files that Flow will need to complete the import includes the following: *filtered\_feature\_bc\_matrix.h5* *per\_barcode\_metrics.csv (or the name is singlecell.csv)* *peaks.bed* *fragments.tsv.gz.tbi* *fragments.tsv.gz*. Those five files can usually be found in the outs/ subdirectory within the pipeline output directory (Figure 8B). Five files are necessary per sample because scATAC-seq is more complicated than RNA-seq. If peak calling was performed on each sample/dataset independently, the peaks are unlikely to be exactly the same so all of the samples/datasets need to be merged to create a common set of peaks across the samples/datasets; this is performed during data import wherein all of the samples/datasets need to be imported at one time, not separately. To add samples, click the green **+** button (Figure8A).

Figure 8. Single cell ATAC count matrix import in Flow.

## Import FASTQs for Cell Ranger - ATAC in Flow Although the index files (I1 or I2) are optional, we encourage users to include all of the FASTQs in the table (Figure 9) while importing data for Cell Ranger - ATAC.

Figure 9. Input FASTQs for Cell Ranger - ATAC in Flow.

The import of ATAC FASTQ files are as straightforward as the (sc)RNA-Seq data. However, we need to associate the two different types of data together for 10x multiome ATAC + Gene Expression data. This includes two steps: 1). Import the ATAC FASTQ files in the new page after clicking on the "**Automatically create samples from files**" in *Data* tab (Figure 10A) and select the data type, as ATAC-Seq, from the dropdown list (Figure 10B). 2). Move back to the Data tab and display all of the files by clicking on the **Show data files** button at the bottom left of **Sample name** table. Then click the green **+** button ![](/files/fiTR7clneDwBdQQBm6sS) (Figure 10C) to add RNA FASTQs to the same sample. Similarly, we will select the data type from the dropdown (mRNA) before we finish the import process by clicking on the **Associate file** button (Figure 10D).

Figure 10. Import FASTQs for 10x multiomics ATAC + GEX data in Flow.

## References 1. 2. 3. 4. ## Additional Assistance If you need additional assistance, please visit [our support page](http://www.partek.com/support) to submit a help ticket or find phone numbers for regional support. --- # Agent Instructions This documentation is published with GitBook. GitBook is the documentation platform designed so that both humans and AI agents can read, navigate, and reason over technical content effectively. Learn more at gitbook.com. ## Querying This Documentation If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question. Perform an HTTP GET request on the current page URL with the `ask` query parameter, and the optional `goal` query parameter: ``` GET https://help.partek.illumina.com/partek-flow/user-manual/task-menu/10x-genomics/cell-ranger-atac.md?ask=&goal= ``` `ask` is the immediate question: it should be specific, self-contained, and written in natural language. `goal` is optional and describes the broader end goal you are ultimately trying to accomplish on behalf of the user. GitBook uses it to tailor the answer towards what is most useful for that goal. The response will contain a direct answer to the question and relevant excerpts and sources from the documentation. Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.