CIO survey: Data mining is not far from CCID network to make data like a human brain, with automatic analysis, judgment and predictive ability, which seems to be incredible, is the function of data mining. Data mining is attracting more and more corporate eye. Recently, the relevant personnel of the Beijing Great Wall Instrument Factory, the National Bureau of Statistics, and the Beijing Bureau of Statistics have expressed concern that data mining is concerned. What is data mining? Which units have applied data mining? How to dig data? How's the effect? What can I learn from? In this issue, the China Geological Survey Bureau, Sinopec Petroleum Exploration and Development Research Institute, Beijing Great Wall Instrument Factory, Chongqing Port Bureau, the National Bureau of Statistics, Hunan Statistics Bureau, Thailand, etc. related personnel and my country database Professor Wang Shan. 20% have been applied, 20% are under construction, 25% is paying attention to data mining from us. According to the survey, 4 units such as the China Geological Survey Bureau, Chongqing Port Bureau, Hunan Statistics Bureau, Nanning, and 20% of 20 investigated enterprises) have been applied to analyze and decision-making support. According to Zhang Yongbo, the person in charge of the Information Center Data Mining Project of China Geological Survey Bureau, in order to find mineral resources, it is necessary to comprehensively handle, analyze and evaluate massive geological information. The traditional approach is to manually evaluate many experts according to their own experience. The manual assessment not only has a long period of period, which is not conducive to discovering that mineral resources, and inevitably bring subjective color, even makes judgment mistakes, which indirectly causes a lot of economic losses. To this end, in the 1980s, the geological industry introduced a computer, and began to explore data mining. Automatic processing and evaluation of massive geological information through data mining, helping people predict which places most likely contain mineral resources. After nearly 20 years of research and development, improvement and application, data mining is currently widely used in geological industries. Unlike the China Geological Survey, Chongqing Port Bureau, Hunan Statistics Bureau and Nanning territory have only started construction of data warehouses in the past two years, and in this basis, the data mining application has been carried out, and it has been initially put into use and effectively assist it. Leaders analyze decisions. In addition, 20% of the Sino-Petrochemical Petroleum Exploration and Development Research Institute, the National Bureau of Statistics, the National Industrial and Commercial Bank, and China Minsheng Bank said that the data mining system is under construction. 25% of the Beijing Great Wall Instrument Factory, the National Bureau of Statistics, and the Beijing Statistics Bureau said that they were concerned that they would like to know what successful cases in China. Other 35% of the interviewed enterprises said that the current information focus is the laying network, improve the office system, application system, etc., not understanding the data mining, and has not considered it. What is data mining? How to achieve data mining? How do it make data automatic analysis, judgment and predictive capabilities like a human brain? According to Professor Wang Shan, director of the vice chairman of the China Computer Society, introduced that data mining is the product of information development to a certain extent, and is a high-level stage of data utilization. With the rapid development of database technology, there are more and more data. Although the current database system can implement data entry, modification, statistics, query, etc., but cannot discover the associations and rules of data in data, and cannot be predicted according to future development trends. How to discover important information behind data, and make higher level analysis to better utilize these data, prompting the appearance of data mining. At present, there are many different definitions in data mining. In short, from data mining is from a large number of incomplete practical application data, the process and knowledge of information and knowledge that people who don't know in advance but may be useful. . There are two sources of data mining, which may be from the data warehouse, or it can be directly from the database. All data need to be selected again, the specific option is related to the task. The so-called data warehouse is not an off-the-shelf product that can be purchased. It is a solution for solving problems.
The data warehouse uses traditional database technology as the basic means of storage data and management resources, with statistical analysis techniques as an effective way to analyze data and extract information, with artificial intelligence technology as a scientific way to dig knowledge and discovery laws. The establishment of the data warehouse is not to replace the original database, but a new application of database technology for supporting decision analysis. It is because the data warehouse integrates rich massive information, which can greatly simplify the data mining process. According to the China Geological Survey, the Chongqing Port Bureau, Hunan Statistics Bureau, Nanning Local Tax and other data mining is based on data warehouse. Implemented. "Let the data have automatic analysis, judgment, and prediction as a human brain is to establish an analysis model", Wang Shan said: "Modeling is to abstract your professional experience, general rules or universal cases into an analysis model. Once the model After you build, you can apply it to those situations, and the result is unknown. "For example, suppose you are a telecommunications company's marketing supervisor, the company wants to develop some new long-distance telephone users. Based on your own experience, when you are looking for who is the most potent new customer, you can first understand which people spend more old customers spending more time on the long-distance call. Because you have a lot of information about old customers, such as age, gender, credit records, and long-distance calls. This is equivalent to you also have the same information on many potential customers. Through statistical analysis of the age, gender, credit records of these old customers, you can infer the most potential new customers. This is much more effective than blindly selling. Modeling is to build a model in the data warehouse, abstract several variables from the specific application. For example, a simplified model of long-distance telephone users can be expressed by customers' occupations, position, annual salary, long calls, gender, and regional variables per month. According to this model, the system can try to dig out the age, gender information of the potential new customer from the large number of call records of the old customers, and help you find new long-distance telephone customers. In fact, the data mining system can be resistant, and ultimately, it is still designed and commanded. The process of excavating data is the process of processing, analyzing, predicting data in accordance with the "model" designed, which is human experience and analysis process is implemented in the computer. Good effect, standard, modeling is the key to talk about the application effect and construction experience of data mining, the relevant person in charge of the China Geological Survey Bureau, Hunan Statistics Bureau, Chongqing Port Bureau and Nanning Taxation Bureau, unanimously believes: the effect is good; if necessary, it is also Future development trend. But the implementation is not easy, the system is still waiting to be improved. Summary experience, they believe that the first data plan must have a unified standard; secondly, modeling is important. According to Zhang Yongbo, the person in charge of the information center data mining project of the China Geological Survey Bureau, from the perspective, first, the analysis efficiency of data mining is much higher than the human evaluation; secondly, data mining can also do the original artificial work, Such as superimposed processing. There are many kinds of geological data. There are dozens of geological data on any space point. Different experts have different evaluations. How to put dozens of data stacks form a comprehensive evaluation, relying on traditional manual operation, not at all It may be realized, and the data mining can be. Therefore, data mining is relatively efficient and complete than manual operation of mineral resources. At the same time, he also believes that it is very difficult to achieve, the hardest is modeling, because it is a process of continuous repeated, continuous improvement. How to make experts experience, thinking, not only need to use professional knowledge, but also use a multidisciplinary theory such as neural network, probability statistics, fuzzy mathematics. In this regard, Xiao Shengli, deputy director of the Data Warehouse Office of Hunan Statistics, also deeply. He believes that modeling is a process of common involvement with a user and developer, usually requiring users to have an expert theoretical level, otherwise, I may not know how to use it.