FastQCFastQC Report
Thu 30 Mar 2017
SRR5229450.subset.fastq.gz

Summary

[OK]Basic Statistics

MeasureValue
FilenameSRR5229450.subset.fastq.gz
File typeConventional base calls
EncodingSanger / Illumina 1.9
Total Sequences100000
Sequences flagged as poor quality0
Sequence length32-76
%GC56

[OK]Per base sequence quality

Per base quality graph

[OK]Per sequence quality scores

Per Sequence quality graph

[FAIL]Per base sequence content

Per base sequence content

[OK]Per sequence GC content

Per sequence GC content graph

[OK]Per base N content

N content graph

[WARN]Sequence Length Distribution

Sequence length distribution

[OK]Sequence Duplication Levels

Duplication level graph

[WARN]Overrepresented sequences

SequenceCountPercentagePossible Source
GATCGGAAGAGCACACGTCTGAACTCCAGTCACAGTCAACAATCTCGTATGCCGTCTTCTGCTTGAAAAAAAAA1100.11TruSeq Adapter, Index 13 (97% over 40bp)

[OK]Adapter Content

Adapter graph

[WARN]Kmer Content

Kmer graph

SequenceCountPValueObs/Exp MaxMax Obs/Exp Position
GATCGGA150.002285250269.482251
CTCGTAT200.007111912652.1638744
TCGTATG200.007111912652.1638745
TAATTGG200.007126015652.1377626
ACACGTC200.00714013952.1116813
GTCAACA306.041775E-446.3678835
GCCGTCT450.004405119530.94291551
AGAGTCC500.007210778527.97482565
TGTTGGC750.00998751926.1572570
GAGTCGG800.002695404521.81140160
GGAGTCG900.00534826519.37817459
GGGGAAG2050.003498957711.8628221