How to use PHP to insert RDF in the Web site (2)

zhaozj2021-02-16 84

Fresh meat

Since it is technically, RSS is a well-structured XML document, so it can be handled with standard XML programming technology. There are two main technologies: SAX (The Simple API for XML) and DOM (The Document Object Model).

The SAX analyzer traverses the entire XML document while encountering a specific function when you encounter a tag that does not have the type. For example, call a specific function to process a start tag, call another function to process an end tag, then call a function to process data between the two. The duties of the analyzer are just the order traversing this document. The function it calls is responsible for processing the discovery mark. Once a tag is processed, the analyzer continues to analyze the next element in the document, which is constantly repeating.

On the other hand, the DOM analyzer works to read the entire XML document into memory and convert it into a hierarchical tree structure. Moreover, the API is provided for accessing different tree nodes (and the content attached to the node). The recursive processing method plus the API function allows developers to distinguish between different types of nodes (elements, attributes, character data, annotations, etc.), while performing different operations based on the node type and node depth of the document tree.

SAX and DOM analyzers support every language, including your favorite - php. I will use PHP's SAX analyzer to process RDF examples in this article. Of course, it is also easy to use the DOM analyzer.

Let us look at this simple example and remember it in your mind. Below is an RDF file I will use, this file is directly selected from http://www.freshmeat.net/:

XMLns = "http://purl.org/rss/1.0/"

XMLns: DC = "http://purl.org/dc/elements/1.1/"

Freshmeat.net </ Title> <link> http://freshmeat.net/ </ link> <Description> FreshMeat.Net Maintains The Web's Largest Index of Unix And Cross-Platform Open Source Software. THOUSANDS OF Applications Are Meticulously Catalog in the Freshmeat.Net Database, And Links to New Code Are Added Daily. </ description> <DC: Language> EN-US </ dc: Language> <DC: Subject> Technology </ dc: Subject> <DC: Publisher> Freshmeat.net </ dc: publisher> <dc: creator> Freshmeat.Net Contributors </ dc: creator> <DC: Rights> Copyright (C) 1997-2002 OSDN </ dc: Rights> <DC: DATE> 2002-02-11T10: 20 00: 00 </ dc: Date> <items> <RDF: SEQ> <rdf: li rdf: resource = "http://freshmeat.net/releases/69583/" /> <rdf: li rdf: resource = "http://freshmeat.net/releases/69581/" /> <! - and so -> </ rdf: SEQ> </ items> <image rdf: resource = "http://freshmeat.net/img/fmii-button.gif" /> <textinput rdf: resource = "http://freshmeat.net/search/" /> </ CHANNEL> <image rdf: About = "http://freshmeat.net/img/fmii-button.gif"> <title> Freshmeat.net </ Title> <URL> http://freshmeat.net/img/fmii-button.gif </ url> <link> http://freshmeat.net/ </ link> </ iMAGE> <item rdf: About = "http://freshmeat.net/releases/69583/>"> <title> sloop.splitter 0.2.1 </ title> <link> http://freshmeat.net/releases/69583/ </ link> <Description> A Real Time Sound Effects Program. </ description> <DC: DATE> 2002-02-11T04: 52-06: 00 </ dc: DATE> </ item> <item rdf: About = "http://freshmeat.net/releases/69581/>"> <title> Apacompile 1.9.9 </ title> <link> http://freshmeat.net/releases/69581/ </ link> <Description> a full-featured apache compiration howto. </ description> <DC: DATE> 2002-02-11T04: 52-06: 00 </ dc: DATE> </ item> <! - and so -> </ rdf: rdf> Below is a PHP script that analyzing this document and displays data in it: <? PHP // xml file $ FILE = "FM-Releases.rdf"; // SET UP Some Variables for Use by the Parser $ CurrentTag = "" $ FLAG = ""; // Create Parser $ XP = XML_PARSER_CREATE (); // set Element Handler XML_SET_ELEMENT_HANDLER ($ XP, "ElementBegin", "Elementend"); XML_SET_CHARACTER_DATA_HANDLER ($ XP, "CharacterData"); XML_PARSER_SET_OPTION ($ XP, XML_OPTION_CASE_FOLDING, TRUE); // read XML File IF ($ fp = fopen ($ file, "r")))))))) { DIE ("Could Not Read $ File"); } // Parse Data While ($ XML = FREAD ($ FP, 4096)) { IF (! XML_PARSE ($ XP, $ XML, Feof ($ FP))))) { DIE ("XML Parser Error:". XML_ERROR_STRING (XML_GET_ERROR_CODE ($ XP))); } } // Destroy Parser XML_PARSER_FREE ($ XP); // Opening Tag Handler Function ElementBegin ($ PARSER, $ Name, $ Attributes) { Global $ CURRENTTAG, $ FLAG; // export the name of the current tag to the global scpe $ currenttag = $ name; // if Withn Item Block, Set A Flag IF ($ Name == "Item") { $ FLAG = 1; } } // Closing Tag Handler Function Elementend ($ Parser, $ Name) { Global $ CURRENTTAG, $ FLAG; $ CurrentTag = "" // if Exitation An Item Block, Print A Line and Reset The Flag IF ($ Name == "Item") { ECHO "<hr>"; $ FLAG = 0; } } // Character Data Handler Function CharacterData ($ Parser, $ DATA) { Global $ CURRENTTAG, $ FLAG; // if Withn iTem Block, Print Item Data IF ($ currenttag == "title" || $ currenttag == "link" || $ currentTAG == "Description") && $ flag == 1) { Echo "$ CURRENTTAG: $ DATA "; } } ?> Do not understand? Don't worry, will be explained later. Capture flag This script must first do to set some global variables: // xml file $ FILE = "FM-Releases.rdf"; // SET UP Some Variables for Use by the Parser $ CurrentTag = "" $ FLAG = ""; $ CURRENTTAG Variable Save is the name of the elements of the analyzer. You will soon see why you need it. Because my ultimate goal is to display each individual entry (Item) in the channel and have a link. Also know when the analyzer exits the <channel> </ channel> block, and when I entered the <Item> </ ITEM> section of the document. Besides, I use the SAX analyzer, it works in order, without any analyzer API, can not be used to know the depth and location in the document tree. So, I have to invent a mechanism to do this - this is the reason for introducing a $ FLAG variable. The $ FLAG variable will be used to determine the analyzer in the <channel> block or in the <Item> block. The next step is to initialize the SAX analyzer and start analyzing the RSS document. // Create Parser $ XP = XML_PARSER_CREATE (); // set Element Handler XML_SET_ELEMENT_HANDLER ($ XP, "ElementBegin", "Elementend"); XML_SET_CHARACTER_DATA_HANDLER ($ XP, "CharacterData"); XML_PARSER_SET_OPTION ($ XP, XML_OPTION_CASE_FOLDING, TRUE); // read XML File IF ($ fp = fopen ($ file, "r")))))))) { DIE ("Could Not Read $ File"); } // Parse Data While ($ XML = FREAD ($ FP, 4096)) { IF (! XML_PARSE ($ XP, $ XML, Feof ($ FP))))) { DIE ("XML Parser Error:". XML_ERROR_STRING (XML_GET_ERROR_CODE ($ XP))); } } // Destroy Parser XML_PARSER_FREE ($ XP); This code is simple, and the comments have been explained enough. The XML_PARSER_CREATE () function creates an analyzer instance and assigns it to the handle $ XP. Then create a backup function to process the on-tag and closed mark, and the character data between the two. Finally, the XML_PARSE () function combines the FREAD () call to read the RDF file and analyze it. In the documentation, each time you encounter a bilus, you will be called by ELEMENTBEGIN (). // Opening Tag Handler Function ElementBegin ($ PARSER, $ Name, $ Attributes) { Global $ CURRENTTAG, $ FLAG; // export the name of the current tag to the global scpe $ currenttag = $ name; // if Withn Item Block, Set A Flag IF ($ Name == "Item") { $ FLAG = 1; } } This function takes parameters as the name and attribute of the current tag. The tag name is assigned to the global variable $ CURRENTTAG. If this is called <item>, then the $ FLAG variable is set. Similarly, if you encounter a closed mark, the closed mark processor ELEMENTENTENTEND () will be called. // Closing Tag Handler Function Elementend ($ Parser, $ Name) { Global $ CURRENTTAG, $ FLAG; $ currenttag = ""; // if exitation an item block, print a line and reset the flag IF ($ Name == "Item") { ECHO "<hr>"; $ FLAG = 0; } } The closed tag handler is also used as its parameters with the marker name. If it is a closed mark for </ item>, the value of the variable $ FLAG is reset to 0 and the value of the variable $ CURRENTTAG is empty. So how do you handle character data between tags? This is our interest. Let's greessing the character data processor CharacterData () first. // Character Data Handler Function CharacterData ($ Parser, $ DATA) { Global $ CURRENTTAG, $ FLAG; // if Withn iTem Block, Print Item Data IF ($ currenttag == "title" || $ currenttag == "link" || $ currentTAG == "Description") && $ flag == 1) { Echo "$ CURRENTTAG: $ DATA "; } } Now you can see the parameters passing to this function, you will find that it only receives the number between the tag and the closed mark, and it does not know that the analyzer is currently being processed "tag. And this is the reason why we introduce global variable $ CURRENTTAG at first. If the value of the $ FLAG variable is 1, that is, if the analyzer is currently between the <Item> </ ITME> block, the currently processed element, regardless of <title>, <link> or <description>, The data is printed onto the output device (here, the output device is a web browser) and adds a newline character after the output of each element. The entire RDF document is handled in this order, and a certain output is displayed for each <Item> tag. You can take a look at the results below:</div><div class="text-center mt-3 text-grey"> 转载请注明原文地址:https://www.9cbs.com/read-27960.html</div><div class="plugin d-flex justify-content-center mt-3"></div><hr><div class="row"><div class="col-lg-12 text-muted mt-2"><h2 class="h6 mb-0 small"><a class="text-secondary" href="tag-2.html">9cbs</a></h2></div></div></div></div><div class="card card-postlist border-white shadow"><div class="card-body"><div class="card-title"><div class="d-flex justify-content-between"><div>New Post(0) </div><div></div></div></div><ul class="postlist list-unstyled"> </ul></div></div><div class="d-none threadlist"><input type="checkbox" name="modtid" value="27960" checked /></div></div></div></div></div><footer class="text-muted small bg-dark py-4 mt-3" id="footer"><div class="container"><div class="row"><div class="col">CopyRight © 2020 All Rights Reserved </div><div class="col text-right">Processed: 0.044, SQL: 9</div></div></div></footer><script src="./lang/en-us/lang.js?2.2.0"></script><script src="view/js/jquery.min.js?2.2.0"></script><script src="view/js/popper.min.js?2.2.0"></script><script src="view/js/bootstrap.min.js?2.2.0"></script><script src="view/js/xiuno.js?2.2.0"></script><script src="view/js/bootstrap-plugin.js?2.2.0"></script><script src="view/js/async.min.js?2.2.0"></script><script src="view/js/form.js?2.2.0"></script><script> var debug = DEBUG = 0; var url_rewrite_on = 1; var url_path = './'; var forumarr = {"1":"Tech"}; var fid = 1; var uid = 0; var gid = 0; xn.options.water_image_url = 'view/img/water-small.png'; </script><script src="view/js/wellcms.js?2.2.0"></script><a class="scroll-to-top rounded" href="javascript:void(0);"></a><a class="scroll-to-bottom rounded" href="javascript:void(0);" style="display: inline;"></a></body></html><script> var forum_url = 'list-1.html'; var safe_token = 'dm55CwkGKHgn_2Bd7GPNPloHIP_2FMH2nN_2F_2Bz4O0kEEzbjvXheuerXMDVIpuaI7MUvQuKsAdEzctkCyZ6JpQM5fb_2Bw_3D_3D'; var body = $('body'); body.on('submit', '#form', function() { var jthis = $(this); var jsubmit = jthis.find('#submit'); jthis.reset(); jsubmit.button('loading'); var postdata = jthis.serializeObject(); $.xpost(jthis.attr('action'), postdata, function(code, message) { if(code == 0) { location.reload(); } else { $.alert(message); jsubmit.button('reset'); } }); return false; }); function resize_image() { var jmessagelist = $('div.message'); var first_width = jmessagelist.width(); jmessagelist.each(function() { var jdiv = $(this); var maxwidth = jdiv.attr('isfirst') ? first_width : jdiv.width(); var jmessage_width = Math.min(jdiv.width(), maxwidth); jdiv.find('img, embed, iframe, video').each(function() { var jimg = $(this); var img_width = this.org_width; var img_height = this.org_height; if(!img_width) { var img_width = jimg.attr('width'); var img_height = jimg.attr('height'); this.org_width = img_width; this.org_height = img_height; } if(img_width > jmessage_width) { if(this.tagName == 'IMG') { jimg.width(jmessage_width); jimg.css('height', 'auto'); jimg.css('cursor', 'pointer'); jimg.on('click', function() { }); } else { jimg.width(jmessage_width); var height = (img_height / img_width) * jimg.width(); jimg.height(height); } } }); }); } function resize_table() { $('div.message').each(function() { var jdiv = $(this); jdiv.find('table').addClass('table').wrap('<div class="table-responsive"></div>'); }); } $(function() { resize_image(); resize_table(); $(window).on('resize', resize_image); }); var jmessage = $('#message'); jmessage.on('focus', function() {if(jmessage.t) { clearTimeout(jmessage.t); jmessage.t = null; } jmessage.css('height', '6rem'); }); jmessage.on('blur', function() {jmessage.t = setTimeout(function() { jmessage.css('height', '2.5rem');}, 1000); }); $('#nav li[data-active="fid-1"]').addClass('active'); </script>