Output file questions
Hello, I had a question about the output following the BUSCO analysis I performed on a genome.
Here is a head of the coordinates.tsv file :
EOG091B0006 scaffold_84 144 48962
EOG091B0007 scaffold_6317 0 17686
EOG091B000H scaffold_384 0 49488
EOG091B000M scaffold_612 101 44632
and of the full_table.tsv file:
Busco id Status Contig Start End Score Length
EOG091B0006 Complete scaffold_84 9626 40090 11575.7 6724
EOG091B0007 Fragmented scaffold_9462 455 4244 1660.7 935
EOG091B000H Complete scaffold_384 4456 43328 7631.6 5014
EOG091B000M Complete scaffold_612 616 36943 7219.5 3780
So I have three questions:
-
We can see that the coordinates of Busco hits along the genome are not the same between the two files for the same Busco hit; Ex: EOG091B0006 [144-48962] vs EOG091B0006 [9626-40090]. Why such a difference?
-
For my analysis I need the information relative to the insertion strand of the sequence, I imagine that these coordinates give the direction of insertion on the positive strand, right?
Thank you very much for your answers.
Edited by Grendel