-
python-tesseract-ocr-0.0.1-2.lbn19.x86_64
Basic Usage
import tesseract_ocr
tesseract_ocr.text_for_filename('code.tiff')
tesseract_ocr.text_for_bytes(open('code.tiff', 'rb').read())
Located in
LBN
/
…
/
Plone and Zope
/
BastionLinux 19
-
python-textile-2.1.5-2.fc19.noarch
Textile is a XHTML generator using a simple markup developed by Dean
Allen. This is a Python port with support for code validation, itex to
MathML translation, Python code coloring and much more.
Located in
LBN
/
…
/
Plone and Zope
/
BastionLinux 19
-
python-textract-1.4.0-1.lbn19.noarch
As undesireable as it might be, more often than not there is extremely useful information embedded in Word documents, PowerPoint presentations, PDFs, etc---so-called "dark data"---that would be valuable for further textual analysis and visualization. While :ref:`several packages <supporting>` exist for extracting content from each of these formats on their own, this package provides a single interface for extracting content from any type of file, without any irrelevant markup.
Currently supporting
textract supports a growing list of file types for text extraction. If you don't see your favorite file type here, Please recommend other file types by either mentioning them on the issue tracker or by :ref:`contributing a pull request <contributing>`.
.csv via python builtins
.doc via antiword
.docx via python-docx
.eml via python builtins
.epub via ebooklib
.gif via tesseract-ocr
.jpg and .jpeg via tesseract-ocr
.json via python builtins
.html and .htm via beautifulsoup4
.mp3 via SpeechRecognition and sox
.msg via msg-extractor
.odt via python builtins
.ogg via SpeechRecognition and sox
.pdf via pdftotext (default) or pdfminer
.png via tesseract-ocr
.pptx via python-pptx
.ps via ps2text
.rtf via unrtf
.tiff via tesseract-ocr
.txt via python builtins
.wav via SpeechRecognition
.xlsx via xlrd
.xls via xlrd
Located in
LBN
/
…
/
Plone and Zope
/
BastionLinux 19
-
python-tornado-2.2.1-3.fc18.noarch
Tornado is an open source version of the scalable, non-blocking web
server and tools.
The framework is distinct from most mainstream web server frameworks
(and certainly most Python frameworks) because it is non-blocking and
reasonably fast. Because it is non-blocking and uses epoll, it can
handle thousands of simultaneous standing connections, which means it is
ideal for real-time web services.
Located in
LBN
/
…
/
Plone and Zope
/
BastionLinux 19
-
python-tornado-doc-2.2.1-3.fc18.noarch
Tornado is an open source version of the scalable, non-blocking web
server and and tools. This package contains some example applications.
Located in
LBN
/
…
/
Plone and Zope
/
BastionLinux 19
-
python-transaction-2.4.0-1.lbn25.noarch
This package contains a generic transaction implementation for Python. It is
mainly used by the ZODB, though.
Located in
LBN
/
…
/
Plone and Zope
/
BastionLinux 19
-
python-translationstring-1.1-5.lbn19.noarch
A library used by various `Repoze <http://repoze.org>`_ packages for
internationalization (i18n) duties related to translation.
This package provides a *translation string* class, a *translation string
factory* class, translation and pluralization primitives, and a utility that
helps `Chameleon <http://chameleon.repoze.org>`_ templates use translation
facilities of this package. It does not depend on `Babel
<http://babel.edgewall.org>`_, but its translation and pluralization services
are meant to work best when provided with an instance of the
``babel.support.Translations`` class.
Located in
LBN
/
…
/
Plone and Zope
/
BastionLinux 19
-
python-twitter-2.2-1.lbn19.noarch
This library provides a pure python interface for the Twitter API. Twitter
(http://twitter.com) provides a service that allows people to connect via the
web, IM, and SMS. Twitter exposes a web services API
(http://twitter.com/help/api) and this library is intended to make it even
easier for python programmers to use.
Located in
LBN
/
…
/
Plone and Zope
/
BastionLinux 19
-
python-ua-parser-0.3.4-1.lbn19.noarch
Python port of Browserscope's user agent parser
Located in
LBN
/
…
/
Plone and Zope
/
BastionLinux 19
-
python-unidecode-0.04.19-1.lbn25.py37.noarch
This is a python port of Text::Unidecode Perl module. It provides a function,
'unidecode(...)' that takes Unicode data and tries to represent it in ASCII
characters.
Located in
LBN
/
…
/
Plone and Zope
/
BastionLinux 19