solr 4.9 suggest.txt词库默认应该是utf-8 无BOM文档
来源:互联网 发布:美剧黑中国 知乎 编辑:程序博客网 时间:2024/06/12 01:49
solr4.9
自动完成配置问题记录
<searchComponent class="solr.SpellCheckComponent" name="suggest"> <lst name="spellchecker"> <str name="name">suggest</str> <str name="classname">org.apache.solr.spelling.suggest.Suggester</str> <str name="lookupImpl">org.apache.solr.spelling.suggest.tst.TSTLookupFactory</str> <!-- Alternatives to lookupImpl: org.apache.solr.spelling.suggest.fst.FSTLookupFactory [finite state automaton] org.apache.solr.spelling.suggest.fst.WFSTLookupFactory [weighted finite state automaton] org.apache.solr.spelling.suggest.jaspell.JaspellLookupFactory [default, jaspell-based] org.apache.solr.spelling.suggest.tst.TSTLookupFactory [ternary trees] --> <str name="field">AllContent</str> <!-- the indexed field to derive suggestions from --> <float name="threshold">0.005</float> <str name="buildOnCommit">true</str> <!-- 使用自定义suggest词库词 --> <str name="sourceLocation">suggest.txt</str> <!-- <str name="spellcheckIndexDir">spellchecker</str> --> </lst> </searchComponent>
这里的suggest.txt词库默认应该是utf-8 无BOM文档,否则solr读取有问题,
如果utf-8有BOM,那么词库第一行无法读取:
solr suggest 自定义词库和系统词库一起使用:
<!-- 可以定义多个,name查询spellchecker --><lst name="spellchecker"> <str name="name">suggestcustom</str> <str name="classname">org.apache.solr.spelling.suggest.Suggester</str> <str name="lookupImpl">org.apache.solr.spelling.suggest.tst.TSTLookupFactory</str> <str name="field">AllContent</str> <float name="threshold">0.005</float> <str name="buildOnCommit">true</str> <!-- 使用自定义suggest词库词 --> <str name="sourceLocation">suggest.txt</str> <!-- <str name="spellcheckIndexDir">spellchecker</str> --> </lst>
查询方式:http://localhost:8080/solr/resource/suggest?q=全&wt=json&indent=true&spellcheck=true&spellcheck.build=true&spellcheck.reload=true&spellcheck.dictionary=suggest&spellcheck.dictionary=suggestcustom
阅读全文
0 0
- solr 4.9 suggest.txt词库默认应该是utf-8 无BOM文档
- UTF-8,无BOM
- UTF-8有BOM与无BOM
- 关于utf-8无bom 编码
- UTF-8(无BOM)和UTF-8区别
- utf-8与utf-8(无BOM)的区别
- utf-8与utf-8(无BOM)的区别
- 【转载】utf-8与utf-8(无BOM)的区别
- utf-8与utf-8无BOM的区别
- UTF-8与UTF-8无BOM的区别
- UTF-8与UTF-8(无bom)的区别
- UTF-8(无BOM)和UTF-8
- 将电脑新建文本文档txt的默认编码从ANSI改为utf-8
- solr学习文档之添加分词词库
- UTF-8带BOM格式与UTF-8无BOM格式转换
- solr suggest
- SciTE4AutoHotkey新建文件默认编码 UTF-8 with BOM
- C#使用StreamWriter生成UTF-8无BOM编码格式
- php-phpMyAdmin 数据库中文乱码问题
- DAY14 PYTHON 数据类型 字符串
- Cookie/Session机制详解
- POJ
- Swift reason: 'attempt to insert row 10 into section 0, but there are only 10 rows in section
- solr 4.9 suggest.txt词库默认应该是utf-8 无BOM文档
- Google Maps基础及实例解析
- DNS Query Code in C with linux sockets
- showModelessDialog新窗口跳转问题
- ORA-00001: unique constraint (CONTRACT_MGR.PK_T_SUPPLIER) violated
- spark的checkpoint
- java笔试题
- 使用ajaxfileupload.js实现ajax异步上传
- 02.java语言基础-引出变量