busco activity

pwkooij opened issue #750: busco stopped working with No module named 'busco' message at ezlab / busco

2024-06-28T21:06:29Z

I used busco about a week ago without any problems. Today I tried to run it (after installing blobtool2 in its own environment) and I receive the following message"

No module named 'busco'
There was a problem installing BUSCO or importing one of its dependencies. See the user guide and the GitLab issue board (https://gitlab.com/ezlab/busco/issues) if you need further assistance.

I tried uninstalling and installing again. I even tried downgrading to 5.6 and 5.5, but I keep receiving the same message. I now also created a new environment for busco, and tried to install it following the instructions on the main site, but still the same.

Any other suggestions of what I could try?

Matthew Berkeley commented on issue #749 at ezlab / busco

2024-06-27T08:50:40Z

Hi, sorry about that. There was a server fault. Things should be working again now.

Coles-DW commented on issue #749 at ezlab / busco

2024-06-27T06:13:01Z

I am having a similar issue. I am running a BUSCO analysis to assess genome completeness. I am getting the error: Cannot reach https://busco-data2.ezlab.org/v5/data/file_versions.tsv

wei shen opened issue #749: Cannot reach https://busco-data2.ezlab.org/v5/data/file_versions.tsv at ezlab / busco

2024-06-27T02:52:15Z

I am experiencing problems with the website and failed to download the file_versions.tsv today. I am trying to train Augustus with BUSCO, and this issue is interrupting my progress. Could you please provide some advice?

Yibo Tong commented on issue #742 at ezlab / busco

2024-06-26T03:36:42Z

use --metaeuk maybe helpful. It is more stable than miniprot.

Marta Benegas Coll opened issue #748: full_table.tsv doesn't keep original contig IDs in the Sequence column at ezlab / busco

2024-06-25T10:16:23Z

Hi! I'm using BUSCO v5.6.0 in genome mode with a prokaryotic organism. I'm running busco with a de novo assembly result. I have two contigs named "contig_2" and "contig_4". However, the busco result only contains "contig" in the "Sequence" column on the full_table.tsv results. This doesn't happen with other eukaryotic organisms that I tried.

The command used and the stdout:

root@b095248dfc43:/app# busco -i /data/input/assembly.fasta -l /data/input/bacillales_odb10 -m genome -c 4 -e 1e-03 -o busco_output --offline
2024-06-25 10:06:49 INFO:	***** Start a BUSCO v5.6.0 analysis, current time: 06/25/2024 10:06:49 *****
2024-06-25 10:06:49 INFO:	Configuring BUSCO with /busco-5.6.0/config/config.ini
2024-06-25 10:06:49 INFO:	Running genome mode
2024-06-25 10:06:49 INFO:	Input file is /data/input/assembly.fasta
2024-06-25 10:06:49 INFO:	Using local lineages directory /data/input/bacillales_odb10
2024-06-25 10:06:49 WARNING:	Option evalue was provided but is not used in the selected run mode, prok_genome_prod
2024-06-25 10:06:49 INFO:	Running BUSCO using lineage dataset bacillales_odb10 (prokaryota, 2024-01-08)
2024-06-25 10:06:49 INFO:	Running 1 job(s) on bbtools, starting at 06/25/2024 10:06:49
2024-06-25 10:06:49 INFO:	[bbtools]	1 of 1 task(s) completed
2024-06-25 10:06:49 INFO:	***** Run Prodigal on input to predict and extract genes *****
2024-06-25 10:06:49 INFO:	Running Prodigal with genetic code 11 in single mode
2024-06-25 10:06:49 INFO:	Running 1 job(s) on prodigal, starting at 06/25/2024 10:06:49
2024-06-25 10:06:54 INFO:	[prodigal]	1 of 1 task(s) completed
2024-06-25 10:06:54 INFO:	Genetic code 11 selected as optimal
2024-06-25 10:06:54 INFO:	***** Run HMMER on gene sequences *****
2024-06-25 10:06:54 INFO:	Running 450 job(s) on hmmsearch, starting at 06/25/2024 10:06:54
2024-06-25 10:06:54 INFO:	[hmmsearch]	45 of 450 task(s) completed
2024-06-25 10:06:54 INFO:	[hmmsearch]	90 of 450 task(s) completed
2024-06-25 10:06:55 INFO:	[hmmsearch]	135 of 450 task(s) completed
2024-06-25 10:06:55 INFO:	[hmmsearch]	180 of 450 task(s) completed
2024-06-25 10:06:56 INFO:	[hmmsearch]	225 of 450 task(s) completed
2024-06-25 10:06:57 INFO:	[hmmsearch]	270 of 450 task(s) completed
2024-06-25 10:06:57 INFO:	[hmmsearch]	315 of 450 task(s) completed
2024-06-25 10:06:59 INFO:	[hmmsearch]	360 of 450 task(s) completed
2024-06-25 10:06:59 INFO:	[hmmsearch]	405 of 450 task(s) completed
2024-06-25 10:07:00 INFO:	[hmmsearch]	450 of 450 task(s) completed
2024-06-25 10:07:00 INFO:	Results:	C:96.9%[S:96.0%,D:0.9%],F:2.7%,M:0.4%,n:450	   

2024-06-25 10:07:01 INFO:	

    ---------------------------------------------------
    |Results from dataset bacillales_odb10             |
    ---------------------------------------------------
    |C:96.9%[S:96.0%,D:0.9%],F:2.7%,M:0.4%,n:450       |
    |436    Complete BUSCOs (C)                        |
    |432    Complete and single-copy BUSCOs (S)        |
    |4    Complete and duplicated BUSCOs (D)           |
    |12    Fragmented BUSCOs (F)                       |
    |2    Missing BUSCOs (M)                           |
    |450    Total BUSCO groups searched                |
    ---------------------------------------------------
2024-06-25 10:07:01 INFO:	BUSCO analysis done with WARNING(s). Total running time: 12 seconds

***** Summary of warnings: *****
2024-06-25 10:06:49 WARNING:busco.BuscoConfig	Option evalue was provided but is not used in the selected run mode, prok_genome_prod

2024-06-25 10:07:01 INFO:	Results written in /app/busco_output
2024-06-25 10:07:01 INFO:	For assistance with interpreting the results, please consult the userguide: https://busco.ezlab.org/busco_userguide.html

2024-06-25 10:07:01 INFO:	Visit this page https://gitlab.com/ezlab/busco#how-to-cite-busco to see how to cite BUSCO

The input data and the output: assembly.fasta full_table.tsv busco_output.zip

Yibo Tong commented on issue #747 at ezlab / busco

2024-06-25T08:38:37Z

It not sometimes, I check all output with 200+ species. Use BUSCO 5.7.1.

Yibo Tong opened issue #747: bbtools output error at ezlab / busco

2024-06-25T08:33:26Z

It just a bug, that sometimes N50 and L50 are reversed.

Main genome scaffold N/L50: 8/138966237 Main genome contig N/L50: 15/48231277 Main genome scaffold N/L90: 16/69359453 Main genome contig N/L90: 54/12256021

It should be 48231277/15.

The genome is ensembl v102 Sus_scrofa.Sscrofa11.1

Rousseau Coralie commented on issue #737 at ezlab / busco

2024-06-20T09:18:50Z

Hi, The links are still not available. Do you find any alternative solutions for that maybe? I'm interested 😄

Best, Coralie

Fuyou Fu commented on issue #731 at ezlab / busco

2024-06-16T15:07:14Z

Did you solve this problem? I have the same issue.

Han Qu closed issue #736: Inquiry about Chromosome Location and Gene Identification in Primate Database at ezlab / busco

2024-06-11T14:57:41Z

I would like to express my appreciation for the amazing software that you have developed. I have a query regarding the targeting of chromosome locations for genes, particularly in the primates_db10 dataset.

I am interested in knowing how I can determine the specific location of each gene on the human chromosome. Is there a way to find the entrezid/gene symbol for each gene instead of the busco id?

Thank you for your time!

金马驭 commented on issue #742 at ezlab / busco

2024-06-04T02:09:59Z

@abhisheknayak389 Thank you for your advise. I only change CPU to redo one of genomes(GCF_000298355.1) and that time it seems not work. I will try to redo all failed genomes with 1 cpu, hope things will be different.Because it really takes a long time. Tanks again!

Abhishek Nayak commented on issue #742 at ezlab / busco

2024-06-03T13:32:02Z

Hello @Marry_King, I tried running BUSCO using only the single cpu (without specifying any cpu's in the command) and it was executed with out any error. Please try to do the same and hopefully it will run successfully.

金马驭 commented on issue #742 at ezlab / busco

2024-06-02T08:08:47Z

I have the same error with you. I run 293 genomes and 32 of them report this error including GCA_004026885.1_MegLyr_v1. A few of them successed while I change mammalia_odb10 to the most specific lineage(cetartiodactyla_odb10 etc.), but the others still failed. I wonder if this error relate to the numbers of contigs because most of failed genomes in my dataset are scaffold level and have more contigs than successed genomes. But I still have no idea to deal with it.

Vamsi Kodali opened issue #746: BUSCO 5.7.1 using conda returns error at ezlab / busco

2024-05-29T20:11:37Z

I installed BUSCO using conda following in the instructions from BUSCO website. The installation itself went well without any errors but after the installation finished, and I try to run BUSCO, I get the following error:

Unable to find module Literal. Please make sure it is installed. See the user guide and the GitLab issue board (https://gitlab.com/ezlab/busco/issues) if you need further assistance.

It appears that one of the dependencies requires python-3.7.12; I don't know which. Is there a way to require python >=3.8 for busco?

Meredith Meyer opened issue #745: High BUSCO Score at ezlab / busco

2024-05-23T16:27:57Z

I ran some phytoplankton transcriptomes using the chlorophyta db and got a combined assembly BUSCO score (single and duplicated BUSCOs) of 99%, but when I run my 5 samples individually, all BUSCO scores are 30-40%. The discrepancy is curious to me. Does anyone know whether this is concerning? Should I focus just on the single BUSCOs and remove the duplicated? I've rerun BUSCO a couple times and gotten the same thing. Thanks!!

Matthew Berkeley commented on issue #744 at ezlab / busco

2024-05-22T09:50:45Z

Hi, the error message in the Miniprot error log suggests either a problem with your Miniprot installation or your input file. I suggest re-installing Miniprot and trying again. Let me know if that fixes the problem. If not I can take a closer look.

Sofia Marques-Hill opened issue #744: single genome run ERROR 'local variable 'target_id' referenced before assignment' at ezlab / busco

2024-05-21T16:19:20Z

Hello, I'm running BUSCO for the first time and I'm following the steps from the user guide.
It is a plant genome so I ran:
~$ busco -i Cirsium_arvense.reference.fasta -l eudicots_odb10 -o CirAr_hap1_BUSCO -m genome. And I'm having the following error:

2024-05-21 09:47:33 INFO: ***** Start a BUSCO v5.7.1 analysis, current time: 05/21/2024 09:47:33 *****
2024-05-21 09:47:33 INFO: Configuring BUSCO with local environment
2024-05-21 09:47:33 INFO: Running genome mode
2024-05-21 09:47:33 INFO: Downloading information on latest versions of BUSCO data...
2024-05-21 09:47:36 INFO: Input file is /annotation/Cirsium_arvense.reference.softmask.fasta 2024-05-21 09:47:36 INFO: Downloading file 'https://busco-data.ezlab.org/v5/data/lineages/eudicots_odb10.2024-01-08.tar.gz' 2024-05-21 09:47:44 INFO: Decompressing file '/annotation/busco_downloads/lineages/eudicots_odb10.tar.gz' 2024-05-21 09:48:09 INFO: Running BUSCO using lineage dataset eudicots_odb10 (eukaryota, 2024-01-08) 2024-05-21 09:48:09 INFO: Running 1 job(s) on bbtools, starting at 05/21/2024 09:48:09 2024-05-21 09:48:17 INFO: [bbtools] 1 of 1 task(s) completed
2024-05-21 09:48:17 INFO: Running 1 job(s) on miniprot_index, starting at 05/21/2024 09:48:17
2024-05-21 09:48:42 INFO: [miniprot_index] 1 of 1 task(s) completed
2024-05-21 09:48:43 INFO: Running 1 job(s) on miniprot_align, starting at 05/21/2024 09:48:43
2024-05-21 09:48:43 INFO: [miniprot_align] 1 of 1 task(s) completed
2024-05-21 09:48:43 CRITICAL: Unhandled exception occurred:
Traceback (most recent call last):
File "/envs/annotation_tools/lib/python3.7/site-packages/busco/BuscoRunner.py", line 165, in run
self.runner.run_analysis()
File "/envs/annotation_tools/lib/python3.7/site-packages/busco/BuscoRunner.py", line 564, in run_analysis
self.analysis.run_analysis()
File "/envs/annotation_tools/lib/python3.7/site-packages/busco/analysis/GenomeAnalysis.py", line 1043, in run_analysis
self.run_miniprot(incomplete_buscos)
File "/envs/annotation_tools/lib/python3.7/site-packages/busco/analysis/GenomeAnalysis.py", line 1071, in run_miniprot
self.miniprot_align_runner.parse_output()
File "/envs/annotation_tools/lib/python3.7/site-packages/busco/busco_tools/miniprot.py", line 370, in parse_output
target_id,
UnboundLocalError: local variable 'target_id' referenced before assignment

2024-05-21 09:48:43 ERROR: local variable 'target_id' referenced before assignment
2024-05-21 09:48:43 ERROR: BUSCO analysis failed!
2024-05-21 09:48:43 ERROR: Check the logs, read the user guide (https://busco.ezlab.org/busco_userguide.html), and check the BUSCO issue board on https://gitlab.com/ezlab/busco/issues

In the error log files the only one that says something else is the miniprot_align_eudicots_odb10_err.log:

[ERROR] failed to open/build the index

Does somebody have an idea of what I'm doing wrong?
Thanks, -Sofia

GianBen opened issue #743: BUSCO analysis failed at ezlab / busco

2024-05-20T16:02:59Z

Hello,

I am trying to run BUSCO on a fungal genome using the fungi_odb10 but I am having this error below. I checked the db and Metaeuk and both seem ok.

version of Busco: BUSCO 5.2.2
2024-05-17 16:18:53 INFO:	***** Start a BUSCO v5.2.2 analysis, current time: 05/17/2024 16:18:53 *****
2024-05-17 16:18:53 INFO:	Configuring BUSCO with local environment
2024-05-17 16:18:53 INFO:	Mode is genome
2024-05-17 16:18:53 INFO:	Downloading information on latest versions of BUSCO data...
2024-05-17 16:18:54 INFO:	Input file is /mnt/home/benucci/project_PleurotusMartina24/reference/GCA_029467805.1_ASM2946780v1_genomic.fna
2024-05-17 16:18:54 INFO:	Using local lineages directory /mnt/research/bonito_lab/DATABASES/BUSCO_fungi/fungi_odb10/
2024-05-17 16:18:54 INFO:	Running BUSCO using lineage dataset  (eukaryota, 2024-01-08)
2024-05-17 16:18:54 INFO:	Running 1 job(s) on metaeuk, starting at 05/17/2024 16:18:54
2024-05-17 16:18:56 INFO:	[metaeuk]	1 of 1 task(s) completed
2024-05-17 16:18:56 ERROR:	Metaeuk did not recognize any genes matching the dataset  in the input file. If this is unexpected, check your input file and your installation of Metaeuk

2024-05-17 16:18:56 ERROR:	BUSCO analysis failed !
2024-05-17 16:18:56 ERROR:	Check the logs, read the user guide (https://busco.ezlab.org/busco_userguide.html), and check the BUSCO issue board on https://gitlab.com/ezlab/busco/issues

Thanks,

Gian

Abhishek Nayak commented on issue #742 at ezlab / busco

2024-05-20T15:27:54Z

I have tried running below from the paper BUSCO: Assessing Genomic Data Quality and Beyond - work
busco -i Tglobosa_GCF_014133895.1_genome.fna -l saccharomycetes_odb10 -m geno -o busco_out_Tglob_genome -c 12 However I tried rerunning using - busco -i GCA_004026885.1_MegLyr_v1_BIUU_genomic.fna -l mammalia_odb10 -m geno -o megaderma_output -c 12 -r

Throws the same error