jPDFText for Linux Icon

jPDFText for Linux

Qoppa Software, LLC (Shareware)
jPDFText is a Java library to extract text from PDF documents
jPDFText for Linux screenshot
Extract text from PDF documents
jPDFText is a Java library to extract text from PDF documents. With jPDFText, PDF documents can be processed to extract the textual content for archiving, storage, searching or indexing. jPDFText is built on top of Qoppas proprietary PDF technology so you do not have to install any third party software or drivers. Since it is written in Java, it allows your application to remain platform independent and run on Windows, Linux, Unix (Solaris, HP UX, IBM AIX), Mac OS X and any other platform that supports the Java runtime environment. Main Features Load PDF documents from files, network drives, URLs or input streams Extract text in the logical reading order Extract words as a vector of Strings Works on Windows, Linux, Unix and Mac OS X (100% Java) No need to install or configure additional drivers or software when deploying Tested on JDK 1.4.2 and above If you require any additional information, dont hesitate to contact us at [email protected] jPDFText can extract existing text content from PDF documents. If you are interesting in recognizing text in scanned PDF documents or PDF documents containing images, you may be interested in our Java OCR feature.
Technical details
jPDFText for Linux 2021R1 from Linux
Basic Windows
OS Support:
Linux, Linux Console, Linux Gnome, Linux GPL, Linux Open Source
Release date:
January 29, 2021
Qoppa Software, LLC (
jPDFText for Linux 2021R1 Changelog

Java 9 Support Rich Text and Non-Latin Unicode Support in Form Fields