CoreNLP 工具的使用

凌轶

2023-12-01

首先，进入到 CoreNLP 的下载页

Overview - CoreNLP

这里我们首先下载 CoreNLP 的英文版本，也就是基础版本

https://nlp.stanford.edu/software/stanford-corenlp-latest.zip

解压，放到文件夹中 PATH1 中

然后，返回 CoreNLP 主页，然后下载中文工具包

https://search.maven.org/remotecontent?filepath=edu/stanford/nlp/stanford-corenlp/4.4.0/stanford-corenlp-4.4.0-models-chinese.jar

将中文文件包放至 PATH1 中

进入到 CoreNLP 的文件夹PATH1，然后输入如下命令

java -mx4g -cp "*" edu.stanford.nlp.pipeline.StanfordCoreNLPServer -serverProperties StanfordCoreNLP-chinese.properties -annotators "tokenize,ssplit,pos,lemma,parse,sentiment" -port 9000 -timeout 30000

然后 CoreNLP 服务器开始监听 9000 端口，并且开始提供服务

想要作分词或者其他分析操作时，建议使用下面的格式

url = "http://127.0.0.1:9000" # 使用本机端口需要先在stanford目录下运行java，见stanford nlp官网
params = {"properties": '{"annotators": "tokenize,ssplit,pos,lemma,ner,parse,depparse,relation"}', "pipelineLanguage": "zh"}
res = requests.post(url, params=params, data=quote(key))

最终的结果是一个词典，我们需要进一步处理

CoreNLP 工具的使用

相关阅读

相关文章

相关问答

相关文档