ERROR: Metaeuk did not recognize any genes matching the dataset embryophyta_odb10 in the input file. If this is unexpected, check your input file and your installation of Metaeuk
I'm having an issue with embryophyta_odb10, similar to another issue that was posted here before, although without the same resolution. What's strange is that the issue is not consistent. I've tried running on a genome and most of the time I will get the error (Metaeuk did not recognize any genes matching the dataset embryophyta_odb10 in the input file. If this is unexpected, check your input file and your installation of Metaeuk), but I have managed to get it to work once or twice. Unfortunately, I am working with very large genomes and sometimes it runs for up to two days before failing. Also, sometimes it fails on the first phase, before running HMMER, and sometimes it fails after running HMMER in the second phase.
I get one of two errors in the metaeuk error log: Can not allocate index memory in DBReader
or
terminate called after throwing an instance of 'std::bad_alloc' what(): std::bad_alloc /pool/home/tshalev/pg29_genome_busco/run_embryophyta_odb10/metaeuk_output/tmp/1565081466817722409/tmp_predict/2025072426226136171/predictexons.sh: line 44: 73670 Aborted "MMSEQS" extractorfs "
{INPUT_CONTIGS}" "${TMP_PATH}/nucl_6f" ${EXTRACTORFS_PAR}
Maybe I'm running out of memory? But we have a substantial amount of RAM on our system. How much memory should BUSCO be using for a ~25 Gb genome?
Thanks for your help!