Dynamic website search engine strategy
Abstract: Dynamic website enriches the function of the website, but for search engines, the situation is different, the dynamic page is generated when the user "input content" or "select", but the search engine "Search Robot" cannot be Enter and Select. In addition, search engines should avoid "Sprider Traps" script errors.
First, the emergence and advantages of dynamic sites
When the earliest Internet appears, the site content is stored on the server in the HTML static page, and the page of the visitor browsing is these actual static pages. With the development of technology, especially database and scripting technology Perl, ASP, PHP, and JSP development, more and more sites begin to take dynamic pages release means. For example, we search for a content on Google.com, the result of the search results page file "itself" does not exist on the Google server, but through the program to call the background database real-time generated when we enter search content, that is, These result pages are dynamic.
The static page site only involves the transfer of files, and the dynamic site is much more complicated, and there is a lot of interactions between users and sites. The website is no longer only the release of content, but becomes an application (Application " It is the expansion of the software industry to the Internet. From the perspective of software, the dynamic site is the separation of logical application layers and data layers. The database is responsible for the storage management of site data, and ASP, PHP, JSP, etc. are responsible for processing site logic. In addition to adding a lot of interaction functions, the benefits are more important for the site's maintenance, updates and upgrades. It can be said that there is no dynamic website technology, and these super large sites on the Internet will not appear.
Second, the search engine will face problems when grabbing the dynamic website page
From the user's point of view, the dynamic website is very good, enriched the function of the site, but for the search engine, the situation is different. (About searching engine and classified catalog, as well as search engines working, please? "Understanding Search Engine" http://www.emarketer.cn/em/expert/45.htm)
The fundamental problem is "Input" and "Select", the dynamic page is generated when the user "input content" or "select", but the search engine "Search Robot" cannot "input" and "select". For example, we must see Feng Yingjian's "Network Marketing Foundation and Practice" in Dangdang Bookstore. The introduction page is dynamically generated. The URL address is: http://www.dangdang.com/product_detail/product_detail.asp? Product_Id = 493698
Here is "?", The product_id parameter value requires us to enter, "Search Robot" can find http://www.dangdang.com/product_detail/product_detail.asp page via the link, but it can't enter "?" Later Product_ID Parameter values, can't grab this page file.
In addition, for the page to reach this "?" Page, the search engine can be grabbed, but in general, the search engine selection does not capture, this is to avoid a kind of "Springs" trap (Spider Traps) "The script error, this error will allow the search robot to discover the infinite loop, and it is wasting time. Third, the search engine strategy of the dynamic website
The dynamic website is to be captured by the search engine. You can use the content publishing system software to convert the dynamic site into a static page. This method is relatively appropriate for the site after the page is released, such as some news categories (such as Sina News: Http://news.sina.com.cn/).
The general dynamic website can capture the search engine by the following approach:
First, we have to let the dynamic page of the URL address "?", Let the dynamic page look like "static page". Take a look at this page http://www.amazon.com/exec/obidos/tg/detail/-/043935806X/, which is obviously a dynamic page, but the URL address looks like "static page". Different dynamic technologies can be implemented by the following technology:
· For dynamic pages with ASP technology, Exception Digital provides a tool called XQASP (http://www.xde.net/products/product_xqasp.htm) to replace "?" To "/".
· For sites that use ColdFusion technology, you need to reconfigure the ColdFusion on the server, use "/" to transfer parameters to the URL, please see the http://coldfusion.com/ site for more detailed information.
· For sites using the Apache server, you can use the REWRITE module to convert the URL addresses with parameters into the form of search engines, which is not default in the Apache server. For detailed information, please see http: // httpd. Apache.org/docs/mod/mod_rewrite.html.
For other dynamic techniques, the corresponding method can also be found in the form of the URL.
Then, you want to create some static pages to point to these dynamic pages (to chase the URL link).
As mentioned earlier, the search engine robot does not "enter" parameters yourself, so let the above dynamic pages are captured by search engines, we also need to tell the address of these pages (that is, parameters). We can create some static pages, generally call the "Gateway Page" in the network marketing, and there is a large number of links to these dynamic pages on these pages.
Submit the addresses of these entry pages to search engines, these pages and links of dynamic pages (changing URL form) can be captured by search engines.
Fourth, search engine support for dynamic website support
While we adjust the dynamic site to adapt to the search engine, the search engine is also developing. So far, most search engines do not support dynamic pages, but Google, Hotbot, etc., Baidu start to try to capture the dynamic website page (including? "" Page), this is now When these search engines are searching, the reason for dynamic links appear in the results.
When these search engines capture the dynamic page, in order to avoid the "Search Robot Trap", only the dynamic pages that are linked from the static page (at least "look" is a static page), and the dynamic page received from the dynamic page link No more crawled. So a dynamic site If you only use these search engines, you can simplify on the basis of the above section: Use only some entry pages, link a lot of dynamic pages, and then submit these entry pages to these search engines.
For direct use of dynamic URL addresses, please note:
· Do not have a session ID in the file URL, do not use ID as a parameter name (especially for Google);
· The less the parameter is, the better, try not to exceed 2;
· Can not use the parameters in the URL to do not use, some parameters are transferred to other places, which can increase the depth and number of dynamic pages captured.