Basic Statistics
Measure | Value |
---|---|
Filename | mono6_n2_can_atra.prinseq.fastq |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 16739774 |
Sequences flagged as poor quality | 0 |
Sequence length | 20-373 |
%GC | 50 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
AAAGCAGGCCCGAGCCGCCTGGATACCGCAGCTAGGAATAATGGAATAGG | 80803 | 0.48270066250595733 | No Hit |
AGACGGACCAGAGCGAAAGCATTTGCCAAGAATGTTTTCATTAATCAAGA | 45435 | 0.27141943493382886 | No Hit |
GAGGATCCATTGGAGGGCAAGTCTGGTGCCAGCAGCCGCGGTAATTCCAG | 19801 | 0.11828714055518313 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
CTACGTA | 1970 | 0.0 | 86682.45 | 350-359 |
CGATGTA | 2295 | 0.0 | 74407.164 | 340-349 |
TACGTAC | 1490 | 1.2473774E-6 | 57303.5 | 350-359 |
TCGATCG | 695 | 4.0708146E-6 | 40950.703 | 330-339 |
ACGTACG | 1620 | 4.4235767E-6 | 35136.715 | 350-359 |
ATCGATC | 1330 | 9.938518E-6 | 25678.863 | 330-339 |
GTACGTC | 1845 | 1.9125306E-5 | 18511.05 | 350-359 |
TACGTCT | 1835 | 2.8377624E-5 | 15509.939 | 350-359 |
GCATCGA | 3725 | 0.0 | 15280.935 | 330-339 |
CGTACGT | 2870 | 1.5734258E-9 | 14874.951 | 350-359 |
GATCGAT | 1500 | 3.5395686E-5 | 14230.37 | 330-339 |
GATGTAC | 6055 | 2.059934E-5 | 14101.11 | 340-349 |
CATCGAT | 3040 | 3.1153955E-5 | 14043.128 | 330-339 |
CGATCGA | 2275 | 6.106437E-5 | 10723.042 | 330-339 |
AGCATCG | 4400 | 6.526295E-5 | 9702.525 | 360-363 |
TGAGCAT | 9220 | 4.7762485E-5 | 9260.545 | 360-363 |
CTGAGCA | 18895 | 0.0 | 9037.546 | 360-363 |
GTACAGC | 6200 | 2.1596176E-4 | 5508.531 | 340-349 |
ATGTACA | 10435 | 1.8353271E-4 | 5454.862 | 340-349 |
CTGTAGT | 6505 | 2.3773141E-4 | 5250.252 | 280-289 |