Package | Description |
---|---|
org.dromara.hutool.extra.tokenizer |
中文分词封装
通过定义统一接口,适配第三方分词引擎 |
org.dromara.hutool.extra.tokenizer.engine |
第三方分词引擎实现
|
org.dromara.hutool.extra.tokenizer.engine.analysis |
Lucene-analysis分词抽象封装
项目地址:https://github.com/apache/lucene-solr/tree/master/lucene/analysis |
org.dromara.hutool.extra.tokenizer.engine.ansj |
Ansj分词实现
项目地址:https://github.com/NLPchina/ansj_seg |
org.dromara.hutool.extra.tokenizer.engine.hanlp |
HanLP分词引擎实现
项目地址:https://github.com/hankcs/HanLP |
org.dromara.hutool.extra.tokenizer.engine.ikanalyzer |
IKAnalyzer分词引擎实现
项目地址:https://github.com/yozhao/IKAnalyzer |
org.dromara.hutool.extra.tokenizer.engine.jcseg |
Jcseg分词引擎实现
项目地址:https://gitee.com/lionsoul/jcseg |
org.dromara.hutool.extra.tokenizer.engine.jieba |
Jieba分词引擎实现
项目地址:https://github.com/huaban/jieba-analysis |
org.dromara.hutool.extra.tokenizer.engine.mmseg |
mmseg4j分词引擎实现
项目地址:https://github.com/chenlb/mmseg4j-core |
org.dromara.hutool.extra.tokenizer.engine.mynlp |
MYNLP 中文NLP工具包分词实现
项目地址:https://github.com/mayabot/mynlp/ |
org.dromara.hutool.extra.tokenizer.engine.word |
Word分词引擎实现
项目地址:https://github.com/ysc/word |
Modifier and Type | Method and Description |
---|---|
static TokenizerEngine |
TokenizerUtil.createEngine(String engineName)
创建对应名称的分词引擎对象
|
static TokenizerEngine |
TokenizerUtil.getEngine()
根据用户引入的分词引擎jar,自动创建对应的分词引擎对象
|
Modifier and Type | Method and Description |
---|---|
static TokenizerEngine |
TokenizerEngineFactory.createEngine()
根据用户引入的分词引擎jar,自动创建对应的分词引擎对象
|
static TokenizerEngine |
TokenizerEngineFactory.createEngine(String engineName)
创建自定义引擎
|
static TokenizerEngine |
TokenizerEngineFactory.getEngine()
根据用户引入的模板引擎jar,自动创建对应的分词引擎对象
获得的是单例的TokenizerEngine |
Modifier and Type | Class and Description |
---|---|
class |
AnalysisEngine
Lucene-analysis分词抽象封装
项目地址:https://github.com/apache/lucene-solr/tree/master/lucene/analysis |
class |
SmartcnEngine
Lucene-smartcn分词引擎实现
项目地址:https://github.com/apache/lucene-solr/tree/master/lucene/analysis/smartcn |
Modifier and Type | Class and Description |
---|---|
class |
AnsjEngine
Ansj分词引擎实现
项目地址:https://github.com/NLPchina/ansj_seg |
Modifier and Type | Class and Description |
---|---|
class |
HanLPEngine
HanLP分词引擎实现
项目地址:https://github.com/hankcs/HanLP Segment.seg(String) 方法线程安全 |
Modifier and Type | Class and Description |
---|---|
class |
IKAnalyzerEngine
IKAnalyzer分词引擎实现
项目地址:https://github.com/yozhao/IKAnalyzer IKSegmenter 非线程全,因此每次单独创建对象 |
Modifier and Type | Class and Description |
---|---|
class |
JcsegEngine
Jcseg分词引擎实现
项目地址:https://gitee.com/lionsoul/jcseg ISegment 非线程安全,每次单独创建 |
Modifier and Type | Class and Description |
---|---|
class |
JiebaEngine
Jieba分词引擎实现
项目地址:https://github.com/huaban/jieba-analysis JiebaSegmenter.process(String, SegMode) 线程安全 |
Modifier and Type | Class and Description |
---|---|
class |
MmsegEngine
mmseg4j分词引擎实现
项目地址:https://github.com/chenlb/mmseg4j-core MMSeg 非线程安全,故单独创建之 |
Modifier and Type | Class and Description |
---|---|
class |
MynlpEngine
MYNLP 中文NLP工具包分词实现
项目地址:https://github.com/mayabot/mynlp/ Lexer 线程安全 |
Modifier and Type | Class and Description |
---|---|
class |
WordEngine
Word分词引擎实现
项目地址:https://github.com/ysc/word Segmentation 线程安全 |
Copyright © 2025. All rights reserved.