Supplementary files
S1 [XLSX]—Description of the 5S-IGS reference dataset,
including: unique (non-redundant) and interspecifically shared
(“ambiguous”) sequences; main structural characteristics (length and
GC content) in the different species (with 25th and 75th percentiles);
outlier sequence variants.
S2 [XLSX]—Basic description of the obtained HTS dataset,
including: the number of reads retained after the preprocessing steps;
length and GC content of each HTS sequence with related scatter plots;
details of the distribution of the HTS sequences in the six samples;
BLAST assignations.
S3 [PDF]—RAxML-inferred guide trees based on 1160 5S-IGS
reference sequences, including: unrooted tree with outliers labelled;
annotated subtrees for each Quercus section.
S4 [PDF]—Colored, annotated versions of 24 RAxML trees
inferred for the reference sequences and six HTS samples with four
different abundance cut-offs (2, 5, 10, 25).
S5 [XLSX] —Taxonomic assignations of the HTS sequences
with total abundance >25, obtained using BLAST and EPA in
each sample.