Home>China>search>Open source free Chinese full-text search engine - XunSearch

Open source free Chinese full-text search engine - XunSearch

Country: China Type: search

Tag: Full text search

Chinese Websites: http://www.xunsearch.com/ Enter The Website

Xunsearch is a full-text search engine solution developed in C/C++ based on xapian and scws, suitable for php full-text search, mysql full-text search and various site searches. It supports high-speed search of massive data, powerful, easy to use, open source and free!

image.png

Xunsearch is a high-performance, full-featured full-text search solution.

Xunsearch aims to help general developers quickly and easily build their own full-text search engines for existing massive data.

Xunsearch is translated into Chinese as "迅搜", which is often abbreviated as XS in the code, which is both an abbreviation of the English name and an abbreviation of the Chinese initials. Here, "迅" means fast, which contains at least two meanings: one represents the responsiveness of search results, and the other is the difficulty and speed of secondary development.

Xunsearch adopts a structured layered design, including backend services and frontend development kits, with clear layers and no overlap. The backend is a daemon written in C/C++, while the frontend uses the most popular scripting language PHP, which is more convenient for web search projects. For details, please refer to the architecture design.

Xunsearch greatly reduces the difficulty of search development. In addition to conventional Chinese word segmentation, field retrieval, Boolean syntax and other functions, it also provides users with much-needed related search, pinyin search, result highlighting, search suggestions, etc. compared to other free solutions. For a specific list, please see our function list.

Xunsearch is truly fully open source and is released using the most popular open source license agreement GPL. You can obtain all the source code of this project for free, use it freely, and modify and redistribute it under the license conditions. For details, please refer to the authorization statement file.

1. Application field

The prerequisite is to have a certain PHP (or other corresponding SDK language) development ability and at least one server with a Unix-type operating system.

Xunsearch can help you build various industry portals/vertical searches, BBS forum searches, CMS/Web site searches, document/literature searches, and various full-text searches based on existing database systems.

2. Historical origins

Around 2006, the founder Hightman began to explore full-text searches. He first released the Chinese word segmentation code implemented in pure PHP, and then launched the Chinese word segmentation patch MyFT for MySQL/MyISAM full-text index. After a series of continuous improvements and improvements, the Chinese word segmentation project scws was officially released in 2007 and has been maintained ever since.

The purpose of developing word segmentation at first was to solve the full-text search of Chinese. Although full-text search technology has become mature, most of them are commercial products. It is really hard to find free, open source, and easy-to-use products, so I had the idea of ​​developing my own full-text search. In 2008, we launched a relatively crude search solution FTPHP, and then in 2010, we started the Xungle project, which directly eliminated various technical requirements and provided users with search services including software and hardware environments.

During the operation of the Xungle project, we found that a large number of users had to solve the search technology on their own for various reasons. In addition, we found an excellent open source search engine basic library Xapian abroad, which was developed in C/C++ and has excellent performance and has been tested for a long time. However, it did not integrate Chinese word segmentation, and the development difficulty and complexity were also relatively large. So we decided to integrate Xapian and Scws, and on this basis, provide more and more commonly used search functions. This is Xunsearch, and the version release is currently being prepared, and the code has been hosted on github.