Recomended repo not working
git checkout before_new_augs2
Downloading data/lm/kenlm.scorer (953 MB)
Error downloading object: data/lm/kenlm.scorer (d0cf926): Smudge error: Error downloading data/lm/kenlm.scorer (d0cf926ab9cab54a8a7d70003b931b2d62ebd9105ed392d1ec9c840029867799): batch response: This repository is over its data quota. Account responsible for LFS bandwidth should purchase more data packs to restore access.
Errors logged to /media/pc1/storage/speechRecognition/DeepSpeech/.git/lfs/logs/20201207T123133.983751277.log
Use `git lfs logs last` to view the log.
error: filtro externo 'git-lfs filter-process' falló
fatal: data/lm/kenlm.scorer: filtro smudge lfs falló
Fix, download manually: https://github.com/DanBmh/DeepSpeech/archive/before_new_augs2.zip
But now have that other error:
Step 1/19 : FROM nvcr.io/nvidia/tensorflow:20.03-tf1-py3
---> 8b2abbd886f0
Step 2/19 : ARG DEBIAN_FRONTEND=noninteractive
---> Using cache
---> ebdbcfa71355
Step 3/19 : ENV DEEPSPEECH_REPO=https://github.com/mozilla/DeepSpeech.git
---> Using cache
---> 54bff2468283
Step 4/19 : ENV DEEPSPEECH_SHA=188a6f2c1ee53dc79acf8abceaf729b5f9a05e7a
---> Using cache
---> 8bb2dfbbba27
Step 5/19 : RUN apt-get update && apt-get install -y --no-install-recommends apt-utils bash-completion build-essential curl git git-lfs libbz2-dev locales python3-venv unzip wget
---> Using cache
---> bd8a4b23a2d5
Step 6/19 : RUN apt-get purge -y python3-xdg
---> Using cache
---> 86bcaf75e9c7
Step 7/19 : RUN apt-get install -y --no-install-recommends libopus0 libsndfile1
---> Using cache
---> 40ce3213de36
Step 8/19 : WORKDIR /
---> Using cache
---> 62e68733d4c3
Step 9/19 : RUN wget https://github.com/mozilla/DeepSpeech/archive/${DEEPSPEECH_SHA}.zip && unzip ${DEEPSPEECH_SHA}.zip && mv DeepSpeech-${DEEPSPEECH_SHA} DeepSpeech && rm -rf ${DEEPSPEECH_SHA}.zip
---> Using cache
---> baf62ea93d28
Step 10/19 : WORKDIR /DeepSpeech
---> Using cache
---> dc0187885d2d
Step 11/19 : RUN python3 -m pip install --upgrade pip
---> Using cache
---> 21685e35db36
Step 12/19 : RUN python3 -m pip install --upgrade pip==20.0.2 wheel==0.34.2 setuptools==46.1.3
---> Using cache
---> 9016e538d89a
Step 13/19 : RUN DS_NOTENSORFLOW=y python3 -m pip install --upgrade -e .
---> Using cache
---> 794059d1b37e
Step 14/19 : RUN python3 util/taskcluster.py --source tensorflow --branch r1.15 --artifact convert_graphdef_memmapped_format --target .
---> Using cache
---> daf88e0df8d8
Step 15/19 : RUN chmod +x convert_graphdef_memmapped_format
---> Using cache
---> ea055aa3fc53
Step 16/19 : WORKDIR /DeepSpeech/native_client
---> Using cache
---> 3ae106f8efe6
Step 17/19 : RUN rm -rf kenlm && git clone https://github.com/kpu/kenlm && cd kenlm && git checkout 87e85e66c99ceff1fab2500a7c60c01da7315eec && mkdir -p build && cd build && cmake .. && make -j $(nproc)
---> Using cache
---> 11e856903f06
Step 18/19 : WORKDIR /DeepSpeech/
---> Using cache
---> f679e0c08ba9
Step 19/19 : RUN ./bin/run-ldc93s1.sh
---> Running in 1213e1f5649b
+ '[' '!' -f DeepSpeech.py ']'
+ '[' '!' -f data/ldc93s1/ldc93s1.csv ']'
+ echo 'Downloading and preprocessing LDC93S1 example data, saving in ./data/ldc93s1.'
+ python -u bin/import_ldc93s1.py ./data/ldc93s1
Downloading and preprocessing LDC93S1 example data, saving in ./data/ldc93s1.
No path "./data/ldc93s1" - creating ...
No archive "./data/ldc93s1/LDC93S1.wav" - downloading...
Progress | | N/A% completed
No archive "./data/ldc93s1/LDC93S1.txt" - downloading...
Progress | | N/A% completed
+ '[' -d '' ']'
++ python -c 'from xdg import BaseDirectory as xdg; print(xdg.save_data_path("deepspeech/ldc93s1"))'
+ checkpoint_dir=/root/.local/share/deepspeech/ldc93s1
+ export CUDA_VISIBLE_DEVICES=0
+ CUDA_VISIBLE_DEVICES=0
+ python -u DeepSpeech.py --noshow_progressbar --train_files data/ldc93s1/ldc93s1.csv --test_files data/ldc93s1/ldc93s1.csv --train_batch_size 1 --test_batch_size 1 --n_hidden 100 --epochs 200 --checkpoint_dir /root/.local/share/deepspeech/ldc93s1
Traceback (most recent call last):
File "DeepSpeech.py", line 12, in <module>
ds_train.run_script()
File "/DeepSpeech/training/deepspeech_training/train.py", line 968, in run_script
absl.app.run(main)
File "/usr/local/lib/python3.6/dist-packages/absl/app.py", line 299, in run
_run_main(main, args)
File "/usr/local/lib/python3.6/dist-packages/absl/app.py", line 250, in _run_main
sys.exit(main(argv))
File "/DeepSpeech/training/deepspeech_training/train.py", line 935, in main
early_training_checks()
File "/DeepSpeech/training/deepspeech_training/train.py", line 920, in early_training_checks
FLAGS.scorer_path, Config.alphabet)
File "/usr/local/lib/python3.6/dist-packages/ds_ctcdecoder/__init__.py", line 42, in __init__
raise ValueError('Scorer initialization failed with error code {}'.format(err))
ValueError: Scorer initialization failed with error code 8198
swig/python detected a memory leak of type 'Alphabet *', no destructor found.
The command '/bin/sh -c ./bin/run-ldc93s1.sh' returned a non-zero code: 1
Edited by Pablo