我将如何像Lucene一样创建自定义搜索索引?

时间:2022-06-25 16:06:21

I implemented a Lucene search solution awhile back, and it got me interested in compressed file indexes that are searchable. At the time I could not find any good information on how exactly you would go about creating a custom search index, so I wonder if anyone can point me in the right direction?

我不久前实现了一个Lucene搜索解决方案,它让我对可搜索的压缩文件索引感兴趣。当时我找不到关于你如何创建自定义搜索索引的任何好信息,所以我想知道是否有人可以指出我正确的方向?

My primary interest is in file formatting, compression, and something similar to the concept of Lucene's documents and fields. It should not necessarily be language specific, but if you can point me to online resources that have language specific implementations with full descriptions of the process then that is okay, too.

我的主要兴趣在于文件格式化,压缩以及类似于Lucene文档和字段概念的东西。它不一定是特定于语言的,但如果您可以指向具有语言特定实现的在线资源以及该过程的完整描述,那么这也是可以的。

2 个解决方案

#1


1  

Managing Gigabytes by Alistair Moffat, Timothy C. Bell

由Alistair Moffat,Timothy C. Bell管理技嘉

Managing Gigabytes cover http://ecx.images-amazon.com/images/I/414SCU9MxqL._BO2,204,203,200_PIsitb-sticker-arrow-click,TopRight,35,-76_AA300_SH20_OU01_.jpg

管理千兆字节包括http://ecx.images-amazon.com/images/I/414SCU9MxqL._BO2,204,203,200_PIsitb-sticker-arrow-click,TopRight,35,-76_AA300_SH20_OU01_.jpg

#2


1  

You may also try to look in the source code of excellent Sphinx search engine.

您也可以尝试查看优秀的Sphinx搜索引擎的源代码。

It is modern full-text open source search engine, and it uses smartly optimized indexes.

它是现代全文开源搜索引擎,它使用智能优化的索引。

#1


1  

Managing Gigabytes by Alistair Moffat, Timothy C. Bell

由Alistair Moffat,Timothy C. Bell管理技嘉

Managing Gigabytes cover http://ecx.images-amazon.com/images/I/414SCU9MxqL._BO2,204,203,200_PIsitb-sticker-arrow-click,TopRight,35,-76_AA300_SH20_OU01_.jpg

管理千兆字节包括http://ecx.images-amazon.com/images/I/414SCU9MxqL._BO2,204,203,200_PIsitb-sticker-arrow-click,TopRight,35,-76_AA300_SH20_OU01_.jpg

#2


1  

You may also try to look in the source code of excellent Sphinx search engine.

您也可以尝试查看优秀的Sphinx搜索引擎的源代码。

It is modern full-text open source search engine, and it uses smartly optimized indexes.

它是现代全文开源搜索引擎,它使用智能优化的索引。