BUSCO extracted protein faa.1, faa.2, faa.3, each of these files contains mutiple sequences inside
Hi dear BUSCO community,
This may not be new, but I couldn't find any answer. I run BUSCO on a genome of one individual, and got output of single-copy genes and Augustus results of extracted proteins. Each single copy gene faa file contains one sequence inside, however for the duplicated genes xxx.faa.1,xxx.faa.2,xxx.faa.3, there are multiple sequences inside. Does anyone know what does this faa.1, faa.2, and faa.3 mean, and what does the multiple sequences inside each fasta file mean? I would like to construct phylogeny trees for each duplicated genes, however I have no idea of how should I concatenate and align these sequences. I would really appreciate if there is any suggestions here...