-
python-tevent-0.9.17-1.fc18.armv6hl
Python bindings for libtevent
Located in
LBN
/
…
/
Access and Identity Management
/
BastionLinux 13
-
python-tevent-0.9.18-1.fc19.armv6hl
Python bindings for libtevent
Located in
LBN
/
…
/
Core Linux
/
BastionLinux 19
-
python-tevent-0.9.19-1.lbn13.x86_64
Python bindings for libtevent
Located in
LBN
/
…
/
Access and Identity Management
/
BastionLinux 13
-
python-tevent-0.9.26-1.lbn19.x86_64
Python bindings for libtevent
Located in
LBN
/
…
/
Access and Identity Management
/
BastionLinux 19
-
python-tevent-0.9.26-1.lbn19.x86_64
Python bindings for libtevent
Located in
LBN
/
…
/
Access and Identity Management
/
BastionLinux 19
-
python-tevent-0.9.26-1.lbn19.x86_64
Python bindings for libtevent
Located in
LBN
/
…
/
Core Linux
/
BastionLinux 19
-
python-textile-2.1.4-3.fc13.noarch
Textile is a XHTML generator using a simple markup developed by Dean
Allen. This is a Python port with support for code validation, itex to
MathML translation, Python code coloring and much more.
Located in
LBN
/
…
/
Plone and Zope
/
BastionLinux 13
-
python-textile-2.1.5-2.fc19.noarch
Textile is a XHTML generator using a simple markup developed by Dean
Allen. This is a Python port with support for code validation, itex to
MathML translation, Python code coloring and much more.
Located in
LBN
/
…
/
Plone and Zope
/
BastionLinux 19
-
python-textile-2.2.2-5.fc25.noarch
Textile is a XHTML generator using a simple markup developed by Dean
Allen. This is a Python port with support for code validation, itex to
MathML translation, Python code coloring and much more.
Located in
LBN
/
…
/
Plone and Zope
/
BastionLinux 25
-
python-textract-1.4.0-1.lbn19.noarch
As undesireable as it might be, more often than not there is extremely useful information embedded in Word documents, PowerPoint presentations, PDFs, etc---so-called "dark data"---that would be valuable for further textual analysis and visualization. While :ref:`several packages <supporting>` exist for extracting content from each of these formats on their own, this package provides a single interface for extracting content from any type of file, without any irrelevant markup.
Currently supporting
textract supports a growing list of file types for text extraction. If you don't see your favorite file type here, Please recommend other file types by either mentioning them on the issue tracker or by :ref:`contributing a pull request <contributing>`.
.csv via python builtins
.doc via antiword
.docx via python-docx
.eml via python builtins
.epub via ebooklib
.gif via tesseract-ocr
.jpg and .jpeg via tesseract-ocr
.json via python builtins
.html and .htm via beautifulsoup4
.mp3 via SpeechRecognition and sox
.msg via msg-extractor
.odt via python builtins
.ogg via SpeechRecognition and sox
.pdf via pdftotext (default) or pdfminer
.png via tesseract-ocr
.pptx via python-pptx
.ps via ps2text
.rtf via unrtf
.tiff via tesseract-ocr
.txt via python builtins
.wav via SpeechRecognition
.xlsx via xlrd
.xls via xlrd
Located in
LBN
/
…
/
Plone and Zope
/
BastionLinux 19