Research objectives, research content and key issues
Through the understanding of natural language processing technology, the J2EE distributed system architecture is used to develop a smart search engine with natural language processing capabilities.
The process of achieving intelligent search is mainly divided into three parts: semantic understanding, knowledge management and knowledge retrieval. Among them, the knowledge base is the basis and core of intelligent search. The knowledge base is provided that the semantic understanding will eventually be provided to the user. That is the same as the Internet, the knowledge structure and capacity of human knowledge are expanded quickly, so the knowledge base also needs good adaptability. During the entire process of semantic understanding, smart word technology is an initial part, which refines the core of the composition, and is used by the semantic analysis module. In the process of word, how can I properly provide sufficient words to analyze program processing, and filter out redundant information, which is an important prerequisite for the quality and speed of the later semantic analysis. The intelligent word of the knowledge base processing technology can avoid a combination of ambiguity generated during splitting. Thus, a good original material is provided for the processing of the semantic understanding. Knowledge retrieval can utilize the results of semantic analysis, the search results for the conceptual level of the knowledge base, give the user the highest accuracy, the strongest retrieval result.
I am involved in the overall system analysis and local module design and development of the search engine, realizing the semantic processing user query based on natural language processing technology, compared to traditional search engines, the main features of the search engine are as follows:
Higher search ease of use
Since the intelligent search engine has a smart word function, it makes the query make it simpler and easy to operate.
Search results accurate
Due to the use of knowledge (concept) retrieval technology, explicit and narrowing search scope, search for useless information is reduced.
Intelligence of search results
Since the intelligent search engine has a comprehensive knowledge base, information retrieval and navigation services are more intelligent. Knowledge in the knowledge base helps solve the problem of expression differences. The difference in expression is that users use different words to express the same concept. The definition of synonyms in the knowledge base can eliminate the difficulties of this expression difference.
The key to adding:
1) How to achieve intelligent word function
2) How to adopt knowledge (concept) retrieval technology, clear and narrow the search range, reduce the search for useless information.
3) How to eliminate the problem of expression differences and provide accurate search results as much as possible
3. The features and innovations of this paper or engineering projects
Using J2EE architecture, distributed content management, highly flexible system constructing frames, can be interacting with existing enterprise applications. Supports multiple file formats, which have semantic processing engines, understand user query content, and analyze the identification of users' information, mines potential intent by analyzing the dominant intended field information based on "intrinsistic" knowledge expression technology, to provide the most appropriate feedback results. And with the in-depth of questions, the understanding of current semantics is enhanced by analyzing the context.