Scheduled Maintenance: We are aware of an issue with Google, AOL, and Yahoo services as email providers which are blocking new registrations. We are trying to fix the issue and we have several internal and external support tickets in process to resolve the issue. Please see: viewtopic.php?t=158230
[Software] How to install Tesseract OCR in Debian 10 (buster)
[Software] How to install Tesseract OCR in Debian 10 (buster)
tesseract-ocr installation in Debian 10 Buster throws 404 errors.
Tried:
apt install tesseract-ocr
Reading package lists... Done
Building dependency tree
Reading state information... Done
The following packages were automatically installed and are no longer required:
adwaita-icon-theme at-spi2-core avahi-daemon dconf-gsettings-backend dconf-service geoclue-2.0
...
Use 'apt autoremove' to remove them.
The following additional packages will be installed:
liblept5 libopenjp2-7 libtesseract4 tesseract-ocr-eng tesseract-ocr-osd
The following NEW packages will be installed:
liblept5 libopenjp2-7 libtesseract4 tesseract-ocr tesseract-ocr-eng tesseract-ocr-osd
0 upgraded, 6 newly installed, 0 to remove and 293 not upgraded.
Need to get 7 176 kB of archives.
After this operation, 22,4 MB of additional disk space will be used.
Do you want to continue? [Y/n]
Err:1 http://ftp.debian.org/debian buster/main amd64 libopenjp2-7 amd64 2.3.0-2+deb10u1
404 Not Found [IP: 199.232.138.132 80]
Err:2 http://ftp.debian.org/debian buster/main amd64 liblept5 amd64 1.76.0-1
404 Not Found [IP: 199.232.138.132 80]
Get:3 http://ftp.debian.org/debian buster/main amd64 libtesseract4 amd64 4.0.0-2 [1 234 kB]
Get:4 http://ftp.debian.org/debian buster/main amd64 tesseract-ocr-eng all 1:4.00~git30-7274cfa-1 [1 592 kB]
Get:5 http://ftp.debian.org/debian buster/main amd64 tesseract-ocr-osd all 1:4.00~git30-7274cfa-1 [2 991 kB]
Get:6 http://ftp.debian.org/debian buster/main amd64 tesseract-ocr amd64 4.0.0-2 [262 kB]
Fetched 6 078 kB in 1s (9 380 kB/s)
E: Failed to fetch http://ftp.debian.org/debian/pool/main/ ... _amd64.deb 404 Not Found [IP: 199.232.138.132 80]
E: Failed to fetch http://ftp.debian.org/debian/pool/main/ ... _amd64.deb 404 Not Found [IP: 199.232.138.132 80]
E: Unable to fetch some archives, maybe run apt-get update or try with --fix-missing?
Tried apt-get update and
apt install tesseract-ocr --fix-missing
but problem persists. Tried also apt install liblept5 but got same errors.
How to install it ?
Tried:
apt install tesseract-ocr
Reading package lists... Done
Building dependency tree
Reading state information... Done
The following packages were automatically installed and are no longer required:
adwaita-icon-theme at-spi2-core avahi-daemon dconf-gsettings-backend dconf-service geoclue-2.0
...
Use 'apt autoremove' to remove them.
The following additional packages will be installed:
liblept5 libopenjp2-7 libtesseract4 tesseract-ocr-eng tesseract-ocr-osd
The following NEW packages will be installed:
liblept5 libopenjp2-7 libtesseract4 tesseract-ocr tesseract-ocr-eng tesseract-ocr-osd
0 upgraded, 6 newly installed, 0 to remove and 293 not upgraded.
Need to get 7 176 kB of archives.
After this operation, 22,4 MB of additional disk space will be used.
Do you want to continue? [Y/n]
Err:1 http://ftp.debian.org/debian buster/main amd64 libopenjp2-7 amd64 2.3.0-2+deb10u1
404 Not Found [IP: 199.232.138.132 80]
Err:2 http://ftp.debian.org/debian buster/main amd64 liblept5 amd64 1.76.0-1
404 Not Found [IP: 199.232.138.132 80]
Get:3 http://ftp.debian.org/debian buster/main amd64 libtesseract4 amd64 4.0.0-2 [1 234 kB]
Get:4 http://ftp.debian.org/debian buster/main amd64 tesseract-ocr-eng all 1:4.00~git30-7274cfa-1 [1 592 kB]
Get:5 http://ftp.debian.org/debian buster/main amd64 tesseract-ocr-osd all 1:4.00~git30-7274cfa-1 [2 991 kB]
Get:6 http://ftp.debian.org/debian buster/main amd64 tesseract-ocr amd64 4.0.0-2 [262 kB]
Fetched 6 078 kB in 1s (9 380 kB/s)
E: Failed to fetch http://ftp.debian.org/debian/pool/main/ ... _amd64.deb 404 Not Found [IP: 199.232.138.132 80]
E: Failed to fetch http://ftp.debian.org/debian/pool/main/ ... _amd64.deb 404 Not Found [IP: 199.232.138.132 80]
E: Unable to fetch some archives, maybe run apt-get update or try with --fix-missing?
Tried apt-get update and
apt install tesseract-ocr --fix-missing
but problem persists. Tried also apt install liblept5 but got same errors.
How to install it ?
-
- Emeritus
- Posts: 2435
- Joined: 2010-12-07 19:55
- Has thanked: 14 times
- Been thanked: 54 times
Re: [Software] How to install Tesseract OCR in Debian 10 (buster)
0 upgraded, 6 newly installed, 0 to remove and 293 not upgraded.
Code: Select all
sudo apt update
sudo apt full-upgrade
sudo apt install tesseract-ocr
Re: [Software] How to install Tesseract OCR in Debian 10 (buster)
Are you sure that full-upgrade will resolve not found errors?
This is heavily used production server. I'm afraid that
sudo apt full-upgrade
breaks something.
How to install without full upgrade?
I need only two .so libraries which are called by .NET 6 wrapper. Can I copy those files manually or install manually?
This is heavily used production server. I'm afraid that
sudo apt full-upgrade
breaks something.
How to install without full upgrade?
I need only two .so libraries which are called by .NET 6 wrapper. Can I copy those files manually or install manually?
-
- Posts: 1101
- Joined: 2016-01-07 12:25
- Has thanked: 5 times
- Been thanked: 16 times
Re: [Software] How to install Tesseract OCR in Debian 10 (buster)
Just use the update and install lines in the previous reply, it should be able to find the packages.
Re: [Software] How to install Tesseract OCR in Debian 10 (buster)
Thank you.
It worked if I replied y
to three apt update questions . I also added tesseract repositories and its key usings instructions from https://notesalexp.org/tesseract-ocr/html/
Was this nessecary or not? Will it break something? apt update throws some errors:
Also wget throws certificate expired errors. I tried
But certtificate expired errors still occur in wget and --no-check-certificate swith is required. How to fix this?
It worked if I replied y
to three apt update questions . I also added tesseract repositories and its key usings instructions from https://notesalexp.org/tesseract-ocr/html/
Was this nessecary or not? Will it break something? apt update throws some errors:
Code: Select all
apt update
Hit:1 http://security.debian.org buster/updates InRelease
Hit:2 http://ftp.debian.org/debian buster InRelease
Hit:3 http://apt.postgresql.org/pub/repos/apt buster-pgdg InRelease
Ign:4 https://notesalexp.org/tesseract-ocr5/buster buster InRelease
Hit:5 http://ftp.debian.org/debian buster-updates InRelease
Hit:6 https://download.mono-project.com/repo/debian stable-buster InRelease
Err:7 https://notesalexp.org/tesseract-ocr5/buster buster Release
Certificate verification failed: The certificate is NOT trusted. The certificate chain uses expired certificate. Could not handshake: Error in the certificate verification. [IP: 95.142.37.135 443]
Hit:8 https://packages.microsoft.com/debian/10/prod buster InRelease
Reading package lists... Done
E: The repository 'https://notesalexp.org/tesseract-ocr5/buster buster Release' does not have a Release file.
N: Updating from such a repository can't be done securely, and is therefore disabled by default.
N: See apt-secure(8) manpage for repository creation and user configuration details.
Code: Select all
update-ca-certificates --fresh
Clearing symlinks in /etc/ssl/certs...
done.
Updating certificates in /etc/ssl/certs...
128 added, 0 removed; done.
Running hooks in /etc/ca-certificates/update.d...
Updating Mono key store
Mono Certificate Store Sync - version 6.8.0.105
Populate Mono certificate store from a concatenated list of certificates.
Copyright 2002, 2003 Motus Technologies. Copyright 2004-2008 Novell. BSD licensed.
Importing into legacy system store:
I already trust 128, your new list has 128
Import process completed.
Importing into BTLS system store:
I already trust 128, your new list has 128
Import process completed.
Done
done.
Re: [Software] How to install Tesseract OCR in Debian 10 (buster)
Ot looks like tesseract 4 was installed:
Application uses https://github.com/Sicos1977/TesseractOCR which uses Tesseract 5.
How to install Tesseract 5? I tried instructions from https://notesalexp.org/tesseract-ocr/html/ but apt still throws untrusted certificate error for this.
Application still throws error
Failed to find library 'libleptonica-1.82.0.dll.so' for platform x64
/usr/lib/x86_64-linux-gnu contains only
liblept.so.5.0.3
How to install libleptonica-1.82.0.dll.so ?
Code: Select all
apt show tesseract-ocr
Package: tesseract-ocr
Version: 4.0.0-2
Priority: optional
Section: graphics
Source: tesseract
Maintainer: Alexander Pozdnyakov <almipo@mail.ru>
Installed-Size: 1 146 kB
Depends: libc6 (>= 2.14), libcairo2 (>= 1.2.4), libfontconfig1 (>= 2.12.6), libgcc1 (>= 1:3.0), libglib2.0-0 (>= 2.12.0), libgomp1 (>= 4.2.1), libicu63 (>= 63.1-1~), liblept5 (>= 1.75.3), libpango-1.0-0 (>= 1.37.2), libpangocairo-1.0-0 (>= 1.22.0), libpangoft2-1.0-0 (>= 1.14.0), libstdc++6 (>= 5.2), libtesseract4 (>= 4.00~), tesseract-ocr-eng (>= 4.00~), tesseract-ocr-osd (>= 4.00~)
Replaces: tesseract-ocr-data
Homepage: https://github.com/tesseract-ocr/
Tag: accessibility::ocr, implemented-in::c++, interface::commandline,
role::program
Download-Size: 262 kB
APT-Manual-Installed: yes
APT-Sources: http://ftp.debian.org/debian buster/main amd64 Packages
Description: Tesseract command line OCR tool
Tesseract is an open source Optical Character Recognition (OCR)
Engine. It can be used directly, or (for programmers) using an API to
extract printed text from images. It supports a wide variety of
languages. This package includes the command line tool.
How to install Tesseract 5? I tried instructions from https://notesalexp.org/tesseract-ocr/html/ but apt still throws untrusted certificate error for this.
Application still throws error
Failed to find library 'libleptonica-1.82.0.dll.so' for platform x64
/usr/lib/x86_64-linux-gnu contains only
liblept.so.5.0.3
How to install libleptonica-1.82.0.dll.so ?
-
- Posts: 1101
- Joined: 2016-01-07 12:25
- Has thanked: 5 times
- Been thanked: 16 times
Re: [Software] How to install Tesseract OCR in Debian 10 (buster)
That version is available in bookworm (testing) so you could try to backport it, but being on oldstable I would imagine it would be an arduous (if possible) task.
Re: [Software] How to install Tesseract OCR in Debian 10 (buster)
Repository
https://notesalexp.org/tesseract-ocr/pa ... ter/amd64/
contains those versions.
How to ignore notesalexp expired certificate errors and installe those packages:
https://notesalexp.org/tesseract-ocr/pa ... ter/amd64/
contains those versions.
How to ignore notesalexp expired certificate errors and installe those packages:
Code: Select all
apt-get update
Hit:1 http://security.debian.org buster/updates InRelease
Hit:2 http://ftp.debian.org/debian buster InRelease
Hit:3 http://apt.postgresql.org/pub/repos/apt buster-pgdg InRelease
Hit:4 http://ftp.debian.org/debian buster-updates InRelease
Hit:5 https://download.mono-project.com/repo/debian stable-buster InRelease
Hit:6 https://packages.microsoft.com/debian/10/prod buster InRelease
Ign:7 https://notesalexp.org/tesseract-ocr-dev/buster buster InRelease
Err:8 https://notesalexp.org/tesseract-ocr-dev/buster buster Release
Certificate verification failed: The certificate is NOT trusted. The certificate chain uses expired certificate. Could not handshake: Error in the certificate verification. [IP: 95.142.37.135 443]
Reading package lists... Done
E: The repository 'https://notesalexp.org/tesseract-ocr-dev/buster buster Release' does not have a Release file.
N: Updating from such a repository can't be done securely, and is therefore disabled by default.
N: See apt-secure(8) manpage for repository creation and user configuration details.
- stevepusser
- Posts: 12930
- Joined: 2009-10-06 05:53
- Has thanked: 41 times
- Been thanked: 71 times
Re: [Software] How to install Tesseract OCR in Debian 10 (buster)
notesalexp is a third-party non-Debian repository.
Don't you think you should have led off with that information, and even had that in the thread title?
You should contact that repo's maintainer for help in fixing its issues. If someone had a problem using my own Debian-compatible Pale Moon OBS repo, I would not expect them to come here first looking for help.
Don't you think you should have led off with that information, and even had that in the thread title?
You should contact that repo's maintainer for help in fixing its issues. If someone had a problem using my own Debian-compatible Pale Moon OBS repo, I would not expect them to come here first looking for help.
Last edited by stevepusser on 2022-10-29 23:51, edited 1 time in total.
MX Linux packager and developer
Re: [Software] How to install Tesseract OCR in Debian 10 (buster)
I installed tesseract 5 from this library disabling certificate verificatrion using
apt -a show tesseract-ocr shows also version 4:
Package: tesseract-ocr
Version: 4.0.0-2
Priority: optional
Section: graphics
Source: tesseract
Maintainer: Alexander Pozdnyakov <almipo@mail.ru>
Installed-Size: 1 146 kB
Depends: libc6 (>= 2.14), libcairo2 (>= 1.2.4), libfontconfig1 (>= 2.12.6), libgcc1 (>= 1:3.0), libglib2.0-0 (>= 2.12.0), libgomp1 (>= 4.2.1), libicu63 (>= 63.1-1~), liblept5 (>= 1.75.3), libpango-1.0-0 (>= 1.37.2), libpangocairo-1.0-0 (>= 1.22.0), libpangoft2-1.0-0 (>= 1.14.0), libstdc++6 (>= 5.2), libtesseract4 (>= 4.00~), tesseract-ocr-eng (>= 4.00~), tesseract-ocr-osd (>= 4.00~)
Replaces: tesseract-ocr-data
Homepage: https://github.com/tesseract-ocr/
Tag: accessibility::ocr, implemented-in::c++, interface::commandline,
role::program
Download-Size: 262 kB
APT-Sources: http://ftp.debian.org/debian buster/main amd64 Packages
Description: Tesseract command line OCR tool
Tesseract is an open source Optical Character Recognition (OCR)
Engine. It can be used directly, or (for programmers) using an API to
...
I tried to remove it using
apt remove tesseract-ocr=4.0.0-2
but it removes version 5.
How to remove 4.0.0-2 version?
apt -a
Code: Select all
apt -o "Acquire::https::Verify-Peer=false" update
apt -o "Acquire::https::Verify-Peer=false" install tesseract-ocr
Is this OK ?apt show tesseract-ocr
Package: tesseract-ocr
Version: 5.2.0-1
Priority: optional
Section: graphics
Source: tesseract
Maintainer: Alexander Pozdnyakov <almipo@mail.ru>
Installed-Size: 2 170 kB
Depends: libarchive13 (>= 3.2.1), libc6 (>= 2.27), libcairo2 (>= 1.2.4), libcurl4 (>= 7.16.2), libfontconfig1 (>= 2.12.6), libfreetype6 (>= 2.2.1), libgcc1 (>= 1:3.0), libglib2.0-0 (>= 2.12.0), libgomp1 (>= 4.2.1), libicu63 (>= 63.1-1~), liblept5 (>= 1.75.3), libpango-1.0-0 (>= 1.37.2), libpangocairo-1.0-0 (>= 1.22.0), libpangoft2-1.0-0 (>= 1.14.0), libstdc++6 (>= 6), libtesseract5 (= 5.2.0-1), tesseract-ocr-eng (>= 4.0.9~), tesseract-ocr-osd (>= 4.0.9~)
Replaces: tesseract-ocr-data
Homepage: https://github.com/tesseract-ocr/
Download-Size: 423 kB
APT-Sources: https://notesalexp.org/tesseract-ocr5/buster buster/main amd64 Packages
N: There is 1 additional record. Please use the '-a' switch to see it
apt -a show tesseract-ocr shows also version 4:
Package: tesseract-ocr
Version: 4.0.0-2
Priority: optional
Section: graphics
Source: tesseract
Maintainer: Alexander Pozdnyakov <almipo@mail.ru>
Installed-Size: 1 146 kB
Depends: libc6 (>= 2.14), libcairo2 (>= 1.2.4), libfontconfig1 (>= 2.12.6), libgcc1 (>= 1:3.0), libglib2.0-0 (>= 2.12.0), libgomp1 (>= 4.2.1), libicu63 (>= 63.1-1~), liblept5 (>= 1.75.3), libpango-1.0-0 (>= 1.37.2), libpangocairo-1.0-0 (>= 1.22.0), libpangoft2-1.0-0 (>= 1.14.0), libstdc++6 (>= 5.2), libtesseract4 (>= 4.00~), tesseract-ocr-eng (>= 4.00~), tesseract-ocr-osd (>= 4.00~)
Replaces: tesseract-ocr-data
Homepage: https://github.com/tesseract-ocr/
Tag: accessibility::ocr, implemented-in::c++, interface::commandline,
role::program
Download-Size: 262 kB
APT-Sources: http://ftp.debian.org/debian buster/main amd64 Packages
Description: Tesseract command line OCR tool
Tesseract is an open source Optical Character Recognition (OCR)
Engine. It can be used directly, or (for programmers) using an API to
...
I tried to remove it using
apt remove tesseract-ocr=4.0.0-2
but it removes version 5.
How to remove 4.0.0-2 version?
apt -a