Basic Statistics
| Measure | Value |
|---|---|
| Filename | mono6_n5_can_atra.prinseq.fastq |
| File type | Conventional base calls |
| Encoding | Sanger / Illumina 1.9 |
| Total Sequences | 20647138 |
| Sequences flagged as poor quality | 0 |
| Sequence length | 20-371 |
| %GC | 51 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
| Sequence | Count | Percentage | Possible Source |
|---|---|---|---|
| AAAGCAGGCCCGAGCCGCCTGGATACCGCAGCTAGGAATAATGGAATAGG | 81964 | 0.3969751158732024 | No Hit |
Adapter Content
Kmer Content
| Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
|---|---|---|---|---|
| ACGTACG | 2735 | 1.1518205E-7 | 27756.467 | 360-364 |
| GTACGTC | 2355 | 1.5224941E-9 | 21490.145 | 360-364 |
| TACGTCT | 1930 | 2.1937012E-9 | 19666.82 | 360-364 |
| CGTACGT | 4710 | 2.8976501E-9 | 16117.608 | 360-364 |
| AGCATCG | 5690 | 1.0368949E-6 | 13341.64 | 350-359 |
| TCGATCG | 1140 | 8.341885E-9 | 13318.234 | 310-319 |
| GATGTAC | 7085 | 2.0015377E-6 | 10714.74 | 360-364 |
| ATCGATC | 1845 | 3.5361154E-8 | 10286.441 | 350-359 |
| TACGTAC | 1905 | 1.8189894E-12 | 7969.967 | 290-299 |
| GATCGAT | 2120 | 5.3647454E-8 | 7161.692 | 310-319 |
| GCATCGA | 5825 | 0.0 | 6516.218 | 350-359 |
| CTACGTA | 2645 | 0.0 | 6377.982 | 350-359 |
| CGATGTA | 2455 | 8.330608E-8 | 6184.4346 | 310-319 |
| CATCGAT | 4860 | 6.461578E-7 | 5206.717 | 350-359 |
| CGGAATT | 2340 | 2.957411E-4 | 5122.398 | 280-289 |
| GCTAACG | 880 | 4.625133E-4 | 4174.1533 | 270-279 |
| ATGTACA | 10785 | 7.057677E-6 | 3519.422 | 360-364 |
| CTTCGCA | 3415 | 6.298515E-4 | 3509.93 | 280-289 |
| GATCGTA | 1070 | 6.8377296E-4 | 3432.9485 | 270-279 |
| CGATCGA | 4600 | 5.47916E-7 | 3300.6057 | 310-319 |