Image to Text Converter

This package converts an image to a text file by first converting the image to a PDF, then extracting the text using OCR, and finally converting it to plain text.

This process utilizes Marker package by VikParuchuri.

Installation

pip install img2otxt

Usage

from img2otxt import convert_image_to_text

image_path = 'path/to/your/image.png'
output_dir = 'path/to/output/directory'
convert_image_to_text(image_path, output_dir)

Marker Package

This package relies on the Marker package by VikParuchuri. For more details about Marker, please refer to its GitHub repository.

Testing

python -m unittest discover tests

License

MIT License

Name		Name	Last commit message	Last commit date
Latest commit History 19 Commits
.github/workflows		.github/workflows
__pycache__		__pycache__
dist		dist
img2otxt.egg-info		img2otxt.egg-info
output		output
New.png		New.png
README.md		README.md
__init__.py		__init__.py
environment.yml		environment.yml
img2otxt.py		img2otxt.py
output.pdf		output.pdf
output.txt		output.txt
requirements.txt		requirements.txt
setup.py		setup.py
test2.py		test2.py
test_convert.py		test_convert.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Image to Text Converter

Installation

Usage

Marker Package

Testing

License

About

Releases 1

Packages

Languages

mohammednabarawy/img2otxt

Folders and files

Latest commit

History

Repository files navigation

Image to Text Converter

Installation

Usage

Marker Package

Testing

License

About

Topics

Resources

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages