# Mapping error statistics

## Introduction

When visualizing high throughput data, for example via a 2D or 3D scatter plot, to display the distance of the observations on NGS observations where each observation has thousands of measurements, dimension reduction techniques need to be used. One can apply Principle Component Analysis, t-Distributed Stochastic Neighbor Embedding etc. Those techniques convert data from high dimensional space to lower dimensional space conveying similar information. When information represented in lower dimensional space is not exactly the same as the one from higher dimensional space data, error is generated. In the advanced option settings of dialogs like PCA, t-SNE, there is *Generate mapping error statistics* option, checking this button will output the mapping error on the plot.

## Sammon's mapping error

The distance between the ith and jth data points in the original space is denoted by $$d$$\*$$*{ij}$$, and the distance between the projections in lower dimensional space is denoted by $$d*{ij}$$. Sammon's mapping error (1) is calculated as the follows:

![](/files/WsFeyEiUc7oH0OnarH5w)

To avoid emphasizing small distance, quadratic mapping error is also generated according to the formula:

![](/files/JBottGFfF1y5lCykwF2E)

Both error measures range from zero to plus infinity.

## References

\[1] Sammon JW, 1969, A nonlinear mapping for data structure analysis

## Additional Assistance

If you need additional assistance, please visit [our support page](http://www.partek.com/support) to submit a help ticket or find phone numbers for regional support.


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://help.partek.illumina.com/partek-flow/white-papers/mapping-error-statistics.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
