Supplementary files

S1 [XLSX]—Description of the 5S-IGS reference dataset, including: unique (non-redundant) and interspecifically shared (“ambiguous”) sequences; main structural characteristics (length and GC content) in the different species (with 25th and 75th percentiles); outlier sequence variants.
S2 [XLSX]—Basic description of the obtained HTS dataset, including: the number of reads retained after the preprocessing steps; length and GC content of each HTS sequence with related scatter plots; details of the distribution of the HTS sequences in the six samples; BLAST assignations.
S3 [PDF]­—RAxML-inferred guide trees based on 1160 5S-IGS reference sequences, including: unrooted tree with outliers labelled; annotated subtrees for each Quercus section.
S4 [PDF]—Colored, annotated versions of 24 RAxML trees inferred for the reference sequences and six HTS samples with four different abundance cut-offs (2, 5, 10, 25).
S5 [XLSX] —Taxonomic assignations of the HTS sequences with total abundance >25, obtained using BLAST and EPA in each sample.