 |
docs2text |
| |
Download (312 K)
|
TextLib is proudly presents standalone documents conversion component/library for direct reading and text extraction from the following supported formats: MS Word/Excel/PowerPoint, Adobe Acrobat PDF and RTF. Docs2text is divided into several parsers, each one is responsible for its format. Below is a brief feature list of each component.
doc2text:
- doesn't require MS Word to process documents;
- fastest possible processing speed - up to 200-300 times faster than using MS Word automation;
- precise output - in most cases output is better than MS Word «Save As Text» does;
- full extraction of tables, numbered and bulleted lists, headers, footers;
- document summary extraction - author, title, keywords etc.;
pdf2text:
- doesn't require Adobe Acrobat to process documents;
- doesn't based on xPDF sources;
- fastest processing speed - up to 100 times faster than its competitors as our customers independent researches shows (see the chart below);
- support of multilanguage documents, including Asian text (CJK);
- support of rotated pages;
- read password protected PDFs;
- advanced text preprocessing, including shadow or duplicated text removal, restoration original document layout etc.
xls2text:
- doesn't require MS Excel to process documents;
- supports headers/footers;
- supports sheets names;
- supports comments;
- supports full number format;
ppt2text:
- doesn't require MS PowerPoint to process documents;
- supports text objects extraction;
- supports notes and comments extraction;
| 2005-11-28 10:52:05 |
| Shareware |
 |
$1080 |
 |
|
|
| Version: |
2.0 |
| Release date: |
2005-11-21 |
| Company: |
TextLib Software |
| OS support: |
Win95,Win98,WinME,Windows2000,WinXP,Windows2003 |
| Language: |
English |
| Size: |
312 K |
| Category: |
Development Tools |
| Homepage: |
|
| Download: |
Primary:
Secondary:
|
| Buy: |
Buy Now
|
| Link Broken? |
Report It |
|
|
| |
RTF-to-HTML DLL 1.3 by
SautinSoft
Component for convert RTF and Text in to HTML, XHTML. The DLL component is absolutely standalone and does not require Microsoft Word or other word-processors. Developers my call it from Visual Basic, C#, VB.Net, Delphi, Java etc. Component can... |
| |
UseOffice .Net 2.0.0 by
SautinSoft
UseOffice .Net is a component for developers for converting between RTF, DOC, XLS, PPT, Text and HTML documents.
The UseOffice .Net requires MS Office® installed on your machine. It works with these Office versions: 2000, XP, 2003 and 2007.... |
| |
ActiveXperts Scripting Toolkit 2.1 by
ActiveXperts Software
ActiveX component to call VBScript functions directly from your source code without invoking WSH, CSCRIPT or WSCRIPT. Use the function result directly. Set function timeout and catch exception errors. The component is thread-safe, so you can use it... |
| |
SSDocument Converter 1.0 by
SautinSoft
SSDocument Converter is a component for converting Microsoft Office 2000 documents into HTML, TXT, RTF and various other formats. Our DLL is COM object and developers my call it from Visual Basic, C#, VBA, VB.Net, ASP, ASP.Net, Delphi, Java or other... |
| |
RTF-to-HTML DLL 2.1.1 by
SautinSoft
.Net component to convert RTF to HTML, XHTML in ASP.Net, C#, VB.Net. The DLL is absolutely standalone and doesn't require MS Word or any other word-processor. Component able to transform RTF to HTML 3.2, HTML 4.01 and XHTML 1.0 with CSS styles.... |
| |
RTF-to-HTML DLL .Net 2.3.0 by
SautinSoft
.Net component to convert Word to HTML, RTF to HTML, XHTML in ASP.Net, C#, VB.Net. The DLL is absolutely standalone and doesn't require MS Word or any other word-processor. Component able to transform RTF to HTML 3.2, HTML 4.01 and XHTML 1.0 with CSS... |
| |
RTF-2-HTML v5 5.9.7 by
EasyByte Software Ltd
RTF-2-HTML v5 is a COM component that Converts RTF to HTML and HTML to RTF perfectly.
RTF-2-HTML is now a v5 release, it is extremely stable having been eveloped over a period of 5 years, with the feedback from many thousands of customers.
It... |
| |
ASP/Text2PDF 1.00 by
Nonnoi Solutions
ASP/Text2PDF is a server side COM component that allows web developers to automatically convert any text to PDF documents. This component allows you to quickly convert text files, text data, newsletters or reports into a more popular format and most... |
| |
Rich-Text-Editor.NET 1.2.0.0 by
dbAutoTrack Ltd.
RichTextEditor.NET is an easy-to-use, professional WYSIWYG (What You See Is What You Get) content editor for ASP.NET. RichTextEditor.NET provides an intuitive Word®-like editor which can replace any TextBox in your ASP.NET application. Even... |
| |
PDF Stamper ActiveX Component 2.0.2008.118 by
Guangming Software
PDF Stamper ActiveX Component can stamp PDF files with watermark and text easily. It is a standalone component and does not depend on Adobe Acrobat, or even Acrobat Reader.
PDF Stamper can be used to stamp PDF files, you can stamp PDF... |
|
|
|
| |
BFO adds text extraction to PDF Library by
BFO
London, England, 27 October 2005, - BFO (Big Faceless Organization), a global supplier of java reporting solutions, strengthens the acclaimed Big Faceless PDF Library with the addition of text and image extraction.
The 2.6.2 release adds the...... |
| |
Advanced Word Repair 1.2 has been released! by
DataNumen, Inc.
Advanced Word Repair is a powerful Word document recovery tool. It uses advanced technologies to scan the corrupt or damaged Word documents (doc files) and recover your data in them as much as possible, so to minimize the loss in file corruption....... |
|
|
|
|