当前位置：首页 > 软件库 > 应用工具 > 文档/文本编辑 >

SILVERCODERS Doc ToText

文档格式转换工具

授权协议 GPLv2

开发语言 C/C++

所属分类应用工具、文档/文本编辑

软件类型开源软件

地区不详

投递者颜博达

操作系统 Windows

开源组织无

适用人群未知

软件官网

官方下载

软件概览

SILVERCODERS DocToText 是一个强大实用的文档格式转换工具，可以把多种文档格式转换成纯文本格式，还可以提取文档中的注释和元数据（作者之类的信息），然后转换成纯文档。

SILVERCODERS Doc To Text 包括一个控制台应用和 C/C++ 库，可以将文本提取格式嵌入到其他应用中。

SilverCoders Doc To Text 支持 MS Office二进制格式 (MS Word (DOC), MS Excel (XLS, XLSB), MS PowerPoint (PPT), and 富文本格式 (RTF)), OpenDocument 格式 (text documents (ODT), spreadsheets (ODS), presentations (ODP) and graphics (ODG)), Office Open XML formats (MS Word (DOCX), MS Excel (XLSX), and MS PowerPoint (PPTX)), iWork formats (PAGES, NUMBERS, KEYNOTE), OpenDocument Flat XML formats (FODP, FODS, FODT), 可移植文档格式 (PDF), Email files (EML) 和超文本标记语言(HTML)。

DocToText 是一个快速阅读控制台，具有文本恢复功能。

使用案例

python 3 | doc转docx

python 3 | doc转docx 由于python 3 中 python-docx包只能对docx操作。 path_original 、path_final 为绝对路径，精确到.doc/.docx def doc_to_docx(path_original, path_final): if os.path.splitext(path_original)[1] == ".doc":
[Elasticsearch] Failed to parse mapping [_doc]: Root mapping definition has unsupported parameters

一、ES7报错 Failed to parse mapping [_doc]: Root mapping definition has unsupported parameters 原因：es7不建议使用type，默认的type未doc，因此默认不支持指定type对应的mapping 解决方法：指定索引类型需修改参数include_type_name PUT index/_mappings
python读写 doc文件和docx文件

背景： Python 中可以读取 word 文件的库有 python-docx 和 pywin32。优点缺点python-docx跨平台只能处理 .docx 格式，不能处理.doc格式pywin32仅限 windows 平台.doc 和 .docx 都能处理。一. pywin32模块这个库很强大，不仅仅可以读取 word，但是网上介绍用 pywin32 读取 .doc 的文章真不多，因为，真
使用python将doc的word文件转换成docx文件

一、学习目标：主要之前使用python提起word的docx的文件的数据。但是今天发现，如果是doc后缀的word文件，会报错，这样就无法提取数据了，然后开始搜索如果使用python将doc抓换成docx文件。发现好多文章都是使用win32com模块处理的。二、直接转换代码：不多说了了，直接上我整理测试成功的代码： from win32com import client as wc #导入模
The mapping definition cannot be nested under a type [_doc] unless include_type_name is set to true

创建es索引模板时报错，因为es7不支持type了，只有一个默认的_doc。解决方法：在url里设置 include_type_name=true PUT http://10.10.101.140:30092/_template/testaa?include_type_name=true { "order":1, "index_patterns":["testaa-*"], "mapping
Mapping Set to Strict, Dynamic Introduction of [_Class] Within [_Doc] Is Not Allowed

强制对ES的mapping加了dynamic:strict限制后,突然报了Mapping Set to Strict, Dynamic Introduction of [_Class] Within [_Doc] Is Not Allowed. 官方解释: Mapping uses type hints embedded in the document sent to the server to

SILVERCODERS Doc ToText

同类工具

相关阅读

相关文章

相关问答

相关文档