Basic Statistics
Measure | Value |
---|---|
Filename | mono6_n5_can_atra.prinseq.fastq |
File type | Conventional base calls |
Encoding | Sanger / Illumina 1.9 |
Total Sequences | 20647138 |
Sequences flagged as poor quality | 0 |
Sequence length | 20-371 |
%GC | 51 |
Per base sequence quality
Per sequence quality scores
Per base sequence content
Per sequence GC content
Per base N content
Sequence Length Distribution
Sequence Duplication Levels
Overrepresented sequences
Sequence | Count | Percentage | Possible Source |
---|---|---|---|
AAAGCAGGCCCGAGCCGCCTGGATACCGCAGCTAGGAATAATGGAATAGG | 81964 | 0.3969751158732024 | No Hit |
Adapter Content
Kmer Content
Sequence | Count | PValue | Obs/Exp Max | Max Obs/Exp Position |
---|---|---|---|---|
ACGTACG | 2735 | 1.1518205E-7 | 27756.467 | 360-364 |
GTACGTC | 2355 | 1.5224941E-9 | 21490.145 | 360-364 |
TACGTCT | 1930 | 2.1937012E-9 | 19666.82 | 360-364 |
CGTACGT | 4710 | 2.8976501E-9 | 16117.608 | 360-364 |
AGCATCG | 5690 | 1.0368949E-6 | 13341.64 | 350-359 |
TCGATCG | 1140 | 8.341885E-9 | 13318.234 | 310-319 |
GATGTAC | 7085 | 2.0015377E-6 | 10714.74 | 360-364 |
ATCGATC | 1845 | 3.5361154E-8 | 10286.441 | 350-359 |
TACGTAC | 1905 | 1.8189894E-12 | 7969.967 | 290-299 |
GATCGAT | 2120 | 5.3647454E-8 | 7161.692 | 310-319 |
GCATCGA | 5825 | 0.0 | 6516.218 | 350-359 |
CTACGTA | 2645 | 0.0 | 6377.982 | 350-359 |
CGATGTA | 2455 | 8.330608E-8 | 6184.4346 | 310-319 |
CATCGAT | 4860 | 6.461578E-7 | 5206.717 | 350-359 |
CGGAATT | 2340 | 2.957411E-4 | 5122.398 | 280-289 |
GCTAACG | 880 | 4.625133E-4 | 4174.1533 | 270-279 |
ATGTACA | 10785 | 7.057677E-6 | 3519.422 | 360-364 |
CTTCGCA | 3415 | 6.298515E-4 | 3509.93 | 280-289 |
GATCGTA | 1070 | 6.8377296E-4 | 3432.9485 | 270-279 |
CGATCGA | 4600 | 5.47916E-7 | 3300.6057 | 310-319 |