Report on the literature report of the literature "XML database in mobile system"

xiaoxiao2021-03-06 208

The status of database technology in information management is self-evident, it has become an important part of advanced information technology, which is the foundation and core of modern computer information systems and computer application systems. The database technology initially produced in the mid-1960s, according to the development of the data model, can be divided into several phases: from the first generation of network database system, to the hierarchical database system, to the relational database system, to contemporary object-oriented The model is a database system for the main feature.

Contemporary database, [1] despite the rise of Internet applications, the large number of XML data has occurred, but the status of the relational database is still mainstream, and the database is more widely applying, and the combination of multidisciplinary technology New database technology is an endless, such as object-oriented and object-relational database systems, mobile database systems, real-time database systems, XML, and semi-structured database systems, parallel and distributed database systems, multimedia databases, and more. This report will explain the content related to the mobile database and XML database.

I. Move the origin of the database

The society has entered the information age, and people's lifestyle have also taken great changes. Modern technology has provided convenient tools for people's exchanges and communication. The times require people to access information anytime, anywhere, and have a service, achieving unconstrained free communication and sharing resources. Ideal, this is a more flexible, complex distributed computing environment, which is called mobile computing. The mobile computing system is different from the traditional distribution computing system, which is a distributed computing system composed of fixed nodes and moving nodes, with mobility, frequent disconnectability, network condition diversity, network communication asymmetry, and high system. Features of scalability and low reliability and finite power supply. These features make traditional distributed database technologies that cannot support or effectively support mobile computing environments. Therefore, it is necessary to improve the existing traditional distributed databases, or redesigned, forming a database technology that fully supports mobile computing environments, that is, mobile databases (Mobile Database). [5]

Second. Mobile database typical system model mobile database system consists of three types of nodes, namely:

(1) Server (SVR): Generally, a fixed node, each server maintains a local database, and the server is connected by a reliable high-speed internet network to constitute a distributed database system in a traditional sense. The server can handle client online requests and maintain a history of all requested.

(2) Mobile Support Station (MSS): MSS is also located in the high-speed network and has wireless networking capabilities, which are used to support a wireless network unit (Cell), which can be Communication with an MSS via a wireless link, thereby communicating with the entire fixed network, and broadcast information transmitted by the MSS can also be received. Server and MSS can be the same machine.

(3) Mobile Client (MOBIT CLIENT, MC): The processing power of MC is very limited and storage capacity relative to the server, and can be disconnected in any wireless unit), often disconnected from the server. (Refers to MC to communicate with the server). Even when it is connected to the server, the network bandwidth between the MC and the server differs significantly between the network environment in the MC, and the network is relatively reliable, and the network delay is large.

III. Development status and development trend of mobile database technology

1. Key technologies for mobile databases

The theory and technology involved in mobile databases covers the latest achievements in today's communication and computer development. Among them, how to perform data management in mobile environments is the key to implementing mobile databases. According to current research institutions, these key technologies are mainly concentrated. In the following aspects: copying and cache technology, data broadcasting technology, mobile query optimization and transaction technology, mobile database security technology. [5] 2. Embedded mobile database, an integral part of the mobile database system

Turning out a lot of information, it is found that many authors will mix embedded mobile databases and mobile databases as a talk, even embedded mobile databases are referred to as mobile databases. I found that both of these are related and different.

I think the mobile database is a more abstract concept. We can understand that the database involved in the mobile environment can be called a mobile database. The concept of embedded mobile database I think is a concept that is generated in a mobile technology application environment. Due to the development of mobile databases for different applications, the database management model is different, and some database models use distributed features, and some use intelligent agents, and some use B / A / S multi-layer structure. and many more. The current mobile database is more typical application mode is the three-level database application mode. The following two drawings give a three-level mobile database system structure diagram for a relatively typical application, [12] The other clearly indicates that the embedded mobile database system section is embedded in the database three-level application mode. [4] It can be seen that people will be embedded in the database part of the mobile device as an embedded mobile database, which is just an integral part of the entire mobile database system.

(1) Database Server DBSVR (Database Server): It can be a large database system, such as: o Racle, Sybase, DB2, SQL Server, typically a fixed node. Maintain a complete copy of the local database on each DBSVR, and the server is connected by a reliable high-speed Internet to constitute a traditional distribution database system.

(2) Move support Node MSS (Mob Ile Suppn RT Stat ION): Data exchange between distributed transaction, control EMDB and DBSVR and supports a wireless unit (Cell), with wireless networking capabilities and synchronization functions, MSS Also located in the high speed network. The server can be with MSS as the same machine.

(3) Location server L s (Locat Ion Server).

(4) Trusted part: consist of fixed networks and fixed hosts on the network, fixed hosts are divided into two categories: one type is host, such as DSSVR and LS without wireless communication; the other is a wireless communication interface MSS .

(5) Mobile client MC (Mob Ile Clien T): The processing capability of MC is very limited and storage capacity relative to the server, and can be mobilized (ie, in any of the wireless units), which is stored above it. The database copy, and the local data is managed by EMDB, and the EMDB can be exchanged with DBSVR through the wireless link with one M SS, and EMDB can be exchanged with DBSVR through the ODBC interface on M SS. Thereby communicating with the entire fixed network, and broadcast information sent by the MSS can also be accepted. It often disconnects with the server (referring to MC unable to communicate with any online communication) even when it is connected to the server, due to the multi-change between the network environment in the MC, the network bandwidth between the MC and the server differs significantly, and reliability Lower, the network is delayed.

(6) Local database rep (Rep Licat ION): Database copy.

(7) EMDB: Embedded mobile database, moderation mode is: Whenever the MC issues Q 1, it first queries the local database (mobile subset), if the query condition requires, return to MC, otherwise Submit the query request to the VS, by the VS instead of the query and return the result to the MC (provided that the two are in connection). If the MC is in a disconnect state, the local query can only be performed on the MC. If the data on the MC is updated during the disconnection period, the data will be integrated when the VS is again connected, and the data is integrated, and the consistency is guaranteed by consistency maintenance algorithm. [12]

The main EMDB application mode is based on embedded devices for clients, running a fine EMDB on embedded devices, connecting to an enterprise-class database through synchronization or replication technology. This application mode is supported while supporting embedded devices, and mobile devices, even wireless mobile devices, constitutes applications based on mobile environments. In the database three-level application mode, the EMDB system generally uses EMDB synchronous / replication server enterprise-level DB, as shown in Figure 2. 3. Development Status of Mobile Database Technology: From Research Trend Application

Mobile database techniques also have places to be further studied, but with market demand, and many achievements made in technology research, mobile database technology has developed from the extensive application area. Various embedded mobile database products have emerged. Especially for the continuous improvement of mobile data processing and management requirements, the embedded mobile database technologies closely combined with various smart devices have been obtained from academia, industry, military, military, civilian departments, and continuous practical use. [7]

4. Features and key technologies in embedded mobile database applications

From the data management mode of the above-mentioned embedded mobile database, you can see that the embedded moving database is an end in a mobile device, and the other end is a distributed database system for a fixed synchronization / replication database. The database of mobile devices requires data processing query transactions on mobile terminal devices, on the one hand, and the data consistency problem of the database, and the synchronous / replication database is interactive with mobile terminals, and Enterprise DB interacts. The embedded mobile database management system is applied to the embedded operating system in the environments where the mobile computing environment is applied. It is generally integrated with the resource limit of the mobile device, which exists as the front end of the application system, and management The data set may be a copy of the subset or subset of the dataset in the backend server, so it has its own characteristics and key technologies that must be solved: [7] [18]

l T Mini kernel structure: Taking into account the limited resource of embedded devices, EMDBMS should be implemented in miniaturization technology, and the system structure is tightened to meet the needs of embedded applications under the premise of satisfying the application.

l Transaction: EMDBMS should have transactional functions, automatic maintenance of the integrity, atomic characteristics; support entity integrity and reference integrity, but transaction processing should be as simplified as possible, and may need to combine throughout the application The feature of the mobile computing environment performs transaction control.

l Backup recovery: The backup and recovery of the embedded database is different from the large DBMS management database, and it is not easy to perform separate services or similar forms, and is done in a simplified manner. EMDBMS should have automatic recovery capabilities that do not require manual intervention to perform embedded database management and provide data backup and recovery to ensure safe and reliable user data.

l Perfect data synchronization mechanism: Data synchronization is the most important feature of embedded databases. By data replication, the variation of the embedded database or the primary database can be applied to the other party to ensure the consistency of the data. A significant feature of the mobile database is a weak connection between the mobile terminal and between the server, that is, low bandwidth, long delay, unstable, and regular disconnection. In order to support the user's operation of the database in a weak environment, the optimistic replication or lazy replication is generally used to allow the user to operate a copy of the data on the local cache. After the network is reconnected, then exchange data modification information with the database server or other terminals, and recover the data consistency by conflict detection and coordination.

l Support multiple connection protocols: EMDBMS should support a variety of communication connection protocols. Connections to the embedded device and the database server can be implemented by serial communication, TCP / IP, infrared transmission, Bluetooth.

l Support multiple embedded operating systems: Embedded mobile DBMS should support Windows CE, Palm OS, etc., such as currently popular embedded operating systems, so that the embedded mobile database management system is not limited by mobile terminals. l Security: Many embedded devices in many applications are critical devices of data management or processing in the system, so the database system on the embedded device is more stringent. At the same time, many embedded devices have high mobility, portability, and non-fixed working environments, and there is also a potential unsafe factor. At the same time, some data has a high privacy, so sufficient security guarantees are required to prevent collision, magnetic field interference, loss, theft, etc. The main measures to ensure data security are: First, the mobile terminal is authenticated to prevent the deceptive access of the illegal terminal; second, the wireless communication is encrypted, preventing data information leakage; third, add a copy of the downloaded data copy To prevent the transition of the physical loss of the mobile terminal.

l System Quick Start: The system reliability and availability of embedded / mobile devices are generally relatively low relative to the fixed host, so the probability of the system failure may be greatly improved. Therefore, in such a computing environment or computing platform, the system must ensure that the system can function through the system through the hardware in the event that cannot be software corrects.

l Configuration and use of EMDBMS and application, it is necessary to provide an interface supporting application development.

l Support Java technology: At present, there are Java-based development applications on many smartphones, you should consider Java or similar support.

l Effective System Process Optimization: In the case of hardware, EDBMS must implement certain query optimization techniques, such as using simple indexing.

l In addition, if a mobile device embedded in the system supports real-time applications, the embedded database system also considers real-time processing requirements. This is because the mobility of the device, if the application request is too long, the task may be invalid after the execution is complete, or the validity is greatly reduced. Therefore, the timeliness and correctness of the processing are equally important.

L. An ideal state is that the user can perform data operation and management of all mobile databases related to him only with one mobile terminal (such as a mobile phone), which requires the front-end system to have versatility, and require mobile databases The interface has a uniform, standard standard. The front-end management system automatically generates a unified transaction command when performing data processing, and submits the currently connected data server execution. This effectively enhances the versatility of the mobile database and expands the application prospect of the embedded mobile database.

5. Development trend of embedded mobile database

From the current development trend of embedded applications, embedded mobile database technology will make database technologies more customized (customizable) and flat civilization, namely: system selection technology route to face specific industrial applications, can't go "big And full "universal route. [7]

Four. XML and XML Database

XML

XML is called Extensible Markup Language. It is a new descriptive marker language that simplifies and improves from standard universal markup language SGML (Standard Generalized Markup Language), which is published by the W3C organization. The original intention of W3C organizations to develop XML standards is to define standards for exchange data on the Internet. Since the XML language itself is simple, open, scalable, flexible, self-description, etc. Today, XML has occupied the academic field of the database and in the business application field.

2. Several important features of XML

Scalability. XML is a meta-dimensional language of the design tag language, which allows users to develop a special tag in the field according to their own needs, even allowing all enterprises, and unique needs according to their own areas to create a special mark to create in this field. The basis of the information sharing and exchange of information. For example, the Mathml described in mathematical expressions, BSML, which is used for biological information, and the like. Self-descriptive. The XML document is self-description, not only people can read the XML document, but also the computer can handle. The data in the XML document can be extracted, analyzed, processed, analyzed, and displays with the desired format. XML indicates that the data is truly independent of the application system, and these data can be reused, so XML is suitable for open information management. Because of its self-descriptive, the data in the document can be created, queried, and updated by XML (XML-Aware), followed by the traditional relational database, similar to the data in the object database. XML can also be used to indicate data that is too complicated and difficult to process for those who have not been considered as documents before they are not seen. So, the XML document is considered as a documentation of the documentation and the documentation of data.

3. XML database

The reason why XML can be called a database, we can know from the famous "XML and Database" in Ronald Bourret. The XML document itself is a collection of data, and in a certain extent, XML and associated peripheral techniques can be said to form a database management system, which includes four aspects, one is data storage, XML document is quite A data area of the XML database, an XML document is a basic storage unit, which is equivalent to a table in the relational database; the second is the mode, DTD, or XML Schema, etc. is a description of the logical model of the XML database; third is query language , XQuery, XPath, XQL, XML-QL, Quilt, etc. can act as query language of the XML database; finally the programming interface, two programming interfaces through SAX and DOM, can implement many management functions of the XML database. Nowadays, the XML database has been technically long progress. It has continuously added traditional database technology such as query optimization, transaction processing, trigger, concurrent control, algebraic system, etc. from the initial simple query engine. Perfect yourself. [36]

4. Classification of XML Database Products

For XML Database Product Categories, Ronald Bourret is in his "XML Database Products" [35], divides XML database products into middleware (XML-enabled Databases), native XML database (Native XML) Databases, XML Server (XML Servers, Wrappers, Six of the Content Management Systems), have a great influence in the industry. However, in the eyes of XML database research and developers, most comparison III three types of categories: native XML database, support XML database, mix XML database, or two categories: native XML database, support XML database. Here I mainly introduce three database products that may be involved in the paper: native XML database, support XML database, XML server.

What is the native XML database? It is specifically designed to store a database of XML documents, which stores XML documents themselves, supports transaction, security, multi-user access, programming API and query language, etc., its internal model is based on XML document format. .

Supporting XML databases is a traditional relationship and an object-oriented database extension. On the basis of traditional databases, an XML mapping layer is added by a database vendor or a third party, managing XML data stored by this mapping layer, and implements traditional database and XML. The conversion between the documents is particularly suitable for data-centric applications. As for the XML server? We know that traditional web servers are based on information transmission based on HTML text. With the emergence of XML technology, it is generated for XML-based web servers. So what is the XML server? Accurate definition of this concept of XML Server is difficult, because this is really a relatively new, and the concept is very broad, although there are already many products called yourself for XML Server, such as Datachannel's DataChannel Server 4.1; Software AG Tamino Excelon's Excelon, but on the scope of the application, each product is different, so this is not defined by XML Server, but summarizes some of these products. The way description to explain the concept of XML Server. Simply put, XML Server is a platform for providing data, which can interact with distributed applications in the form of an XML document. For example, the application of e-commerce. This is very similar to the traditional database, which provides the storage and extraction function of the data as the database, but the format of the data is based on XML, so it is completely different from the traditional database. Technically, XML servers typically include a complete application development environment and make applications easily access and use this data through a variety of data storage methods. Stored data includes traditional database data, email information, and file systems, and more.

5. Technical issues related to XML

Document type

We believe there are two types of XML documents, data-centric documents (Document-Centric Documents) and documentation (Document-Centric Documents), distinguish between their respective features will affect us How to select the storage method of XML in the database.

Data-centric documents: Data-centric documents have a very rule result, such as the XML documentation on the sales order or the restaurant menu. The data-centric document is usually designed for the machine, that is, it is mainly convenient for the machine to process. Typically, any Web site can dynamically build an HTML document, as follows, find the related XML document, then transform the XML document via XSL, allowing HTML-based browsers to easily browse results .

Document-centric documents centered: The documentation centered on the document has an irregular structure, and the particle size of the data is also large. Specific examples such as books, emails, advertisements, and more. The documentation-centric documentation is mainly designed with humans.

In reality, the difference between data-centered and document-centered documents is not necessarily significant. For example, another document-centric document, such as invoices, may contain large particle size, structural irregular data, such as part description; another document, document, file, such as user manual, may contain fine-grained structural rules Data (usually metadata) such as the author and revision date. Other examples include legal and medical instruments, although written in loose form but contain discrete data blocks such as dates, names, and operation procedures, for regulations, for regulations, it is usually stored in a complete file form.

2. Document storage

The advantage of using XML format is to express the structure and content of data well, regardless of whether the XML document is data-centered or in document, whether data-centered is structured data or semi-structural data. In the end, we all need to consider how to save the data of XML expression well, of course, this involves the problem of document storage. Experts put forward some suggestions in this regard, can be described as follows: We typically store data in traditional databases, such as relationships, object-oriented or hierarchical databases. This can be completed by the third party's middleware or by the database itself provides internal support (ie, support XML databases) to implement conversion and storage of data formats. However, for semi-structured data, if it mapped to the relational database, the result is that a large number of null values (NULL), or the number of tables is too large, and the waste space or low is low. Although semi-structured data can be stored in an object-oriented or hierarchical database, or in the blob of the relational database, it is also possible to select it in the form of an XML file in the form of a native XML database.

With documentation, it can be stored in a native XML database (a database designed for storage XML) or a content management system (built on the original XML database). However, it can also be stored in a database that supports XML, and it is usually not required, but the XML document is written in the table of the relational database in the form of BLOB.

3. Conversion between documents and databases

In the above mentioned, the XML document is stored in the traditional database, or the data is removed from the database, and the result file is converted into the XML document required by the application, and uses the XML document as the data intermediary between the database, At this time, the conversion software between the XML document and the database is completed by the map between the XML document and the database. This map is divided into two types: template driver and model drive.

Template-driven mapping: there is no pre-mapping between prior documents and databases, but embedded in the template of the data conversion software processing, and the general data transmission middleware is processed. For example, consider the following template:

The Following Flights Have Available Seats:

Select Airline, FltNumber, Depart, Arrive from Flights

We Hope One of these Meets Your Needs

Note that a SELECT statement is embedded. When processing the middleware with the data transfer, each SELECT statement will be replaced by its results, and the format format formatted in XML is:

The Following Flights Have Available Seats:

ACME

123

DEC 12, 1998 13:43

DEC 13, 1998 01:21

...

We Hope One of these Meets Your Needs.

Template-driven mapping can be quite flexible, for example, some products allow you to put the results to any location of the XML document, and you can set parameters to the SELECT statement, and you can use the for loop statement and the IF condition statement. It is worth noting that the current template-driven mapping can only be applied to pass data between relational databases and XML documents.

Model-driven mapping: refers to the data in the XML document to map data into the database according to pre-defined models, clearly or implicitly. In an XML document, these two models are very common: Table Model and Data-Specific Object Model.

The table model is a table-based mapping, and many middleware packages are passed between the XML document and the relational database. It expresses the XML document as a single table or a collection of tables. In this way, the structure of an XML document can be represented in the following form:

...

Here, the keyword "talbe" indicates a single result set when passing the data from the database to the XML document, when passing the data from an XML document to the database, indicates a single table or view. However, when the result set is not only one, or when the XML document includes multiple complex nested, this conversion cannot be adapted, and this conversion cannot retain the physical structure of the document (such as characters and entities), CDATA Part or character encoding), document information (such as document type or DTD), annotation information, and processing instructions, etc.

The dedicated data object model is also an object--relational mapping, which is used to support XML relational databases and some middleware products that convert data between XML documents and relational databases. In this model, the data in the XML document is used as an object tree, and the element type, element content, or mixed content (complex data type) with attributes is modeled as a class, and the element type of PCDATA content (simple data) Types), attributes, and PCDATA modeling as a hierarchical property, then map this model to a relational database using a traditional object-relational mapping technology or SQL3 view, where the class is mapped into a table, the hierarchical property maps the field in the table Columns, the attributes of the object value are mapped to primary key / foreign key pairs. There is a significant correspondence between XML documents, objects, and database tables, as shown below:

XML document element a object a

Object a {

datab b = "datab"

datac c = "datac"

DATAD d = "datad" }

In fact, when data is transformed between XML and databases, it is necessary to consider two processes: one is generating DTD from database mode, and the other is based on DTD generation database mode.

The steps to generate a relationship mode from a DTD:

l For each element, generate a table and a primary key column.

l For each element with mixed content, generate a separate table to store PCDATA and associate through the primary key and parent table of the parent table.

l The properties of each of the single value in the element type, the sub-elements having only PCDATA content (the child element appears in order), generate a separate column, if the child element type or value is selected, the column is It should be allowed to be NULL types.

l For the attributes with multiple values and multiple sub-elements (this sub-elements PCDATA), you need to create a separate table to store these values and connect them through the primary key and parent table of the parent table.

l For each sub-element containing an element or mixed content, the parent element is coupled to the sub-element through the primary key of the parent table.

Build a DTD step from a relational database mode as follows:

l Create an element for each table.

l Table each column in the table creates an attribute or a child element with only PCDATA content.

l Create a child element of the table element based on each of the main keys / foreign key relationships in the table.

Query language

Since the research and development of XML technology in 1995, the shape of XML inquiry language has been continuously introduced. Compared to early XML-QL, XQL, UNQL, later quilt, XPath, and XQuery developed by Quilt. With the strong support of W3C and academia, XQuery gradually stood out in these inquiry languages, and XQuery has become a factual industrial standard. XQuery's FLWR statement specification, has a fully similar expression of SQL with relational databases, making it friendliness in the eyes of the general user. The XPath can be understood to be a subset of XQuery. The XPath expression is proved to be equivalent to the query mode tree in the relevant literature, which is also consistent with the mode tree query of the academic community, so that the laboratory system can handle the XPath query expression without difficulty, and can inquire Optimization, this is quite valuable in XML database research.

Document analysis

For any processing above the XML document, such as the conversion is loaded into the database, or a query, it will pass an XML data parser. The data parser performs the corresponding application after analyzing the XML data in accordance with certain rules. The current data parser generally provides two ways of SAX (Simple API for XML) and DOM (Document Object Model). SAX and DOM are two different application programming interface APIs for XML documents. The current XML database products support these two parsing methods.

The DOM parser is made by W3C. Its parsing process is to read the entire XML document, and then build a tree structure with a hierarchy of the memory. The application can operate this tree structure with a set of interfaces provided, and the DOM is considered to be treated or based on objects. Analysis. The biggest problem is that the DOM documentation obtained after parsing is very large. The ratio of the DOM document and the XML document is easy to exceed 10 (the actual coefficient depends on the average length of the text in the file, the average length of the text is more High), obviously this is a large XML document that uses DOM parsing as required to have great memory, which is limited by memory capacity, and the DOM parser must read the entire document before the code is running, for very Big documents, if the user only pays attention to the small part of the document, creates the object that is never used is extremely wasteful, and the big document will also cause significant delay. In order to solve the DOM problem, the XML-DEV mailing list member creates a SAX interface. SAX is based on event-based resolution. When the SAX uses a way to quickly read and write XML data, when the document is parsed, a series of events will be triggered for different objects, and activate the corresponding event handler (programmer writing) to complete the pair Access to the XML documentation. This shows that SAX parsers do not create any object, but let you decide what type of data structure uses data structure to save data from these events, don't need it, and you can find your own data, you can Throw an exception, stop the SAX parser, and thus do not need to access the full document, the SAX parser is parsed while reading, with a certain real-time, particularly suitable for processing of XML stream data. Of course, we also see the shortcomings of SAX, SAX events are stateless, and events only give you the text, but it will not tell you what element contains this text, you must write status management code; additional SAX event is not lasting , This point is different from the DOM, DOM parsing once, all data can be kept, SAX parsing XML document is in order, the event is constantly occurring during the parsing process, the user needs to write code to the data to be transferred. If the data is not stored, then the data is not dealt with, it can only be re-enrigerated.

For the respective advantages and disadvantages of DOM and SAX, for large XML documents, we tend to use SAX parsing to obtain the desired part of data, and then the DOM parsing.

6. About native XML database

People have increasingly tend to think that the XML database is the native XML database.

1. What is the native XML database?

One member from XML: DB Mailing List defines a native XML database: It defines a (logical) model for an XML document (instead of data in the document) and access files based on the model. This model should at least include elements, properties, PCDATA, and file sequences. An example of this model has an XPath data model, XML Infoset, and a model used by the DOM and an event of SAX 1.0. It uses XML files as its basic (logic) storage unit, as the relational database is based on the list in the table as a basic (logical) storage unit. It does not have special requirements for the underlying physical storage model model. For example, it can be built on a relational, hierarchy, or object-oriented database, or using a dedicated storage format, such as an index or compressed file.

2. Structure of native XML database

The structure of the native XML database can be divided into two categories: text-based and model-based.

The text-based native XML database stores XML as a text. It can be files in the file system, the BLOB or specific file format in the database. (In fact, in fact, a relational database that adds an XML processing feature that supports a CLOB (Character Large Object) field can also be a native XML database.) Based on the model's native XML database. They are not stored in plain text, but constructed an internal model according to files and store this model to the database. The file access performance of this type of native XML database is similar to those database, which is obvious, and its access is dependent on these databases. But this database, especially the design of the native XML database established above other databases has a large change room.

3. Select several reasons for using the native XML database [25] [48] [49]

l XML is a meta-language. XML can define and describe any kind of information.

l XML supports multiple languages. XML is based on Unicode.

l Display the diversity of data. The user obtains an XML data that can be diverted according to different needs. For example, for a paragraph, you can display text or you can display it with a table, and even read it with voice.

l has strong description and search capabilities for data, especially semi-structural data, such as web, letters, etc.). XML allows nested definitions that can describe data for complex structures.

l XML is based on text, easy to read. According to the physical form of the XML database stored data, the readout speed of the data can be much faster than the read speed of the relational database.

l XML is open to get support for almost all mainstream databases, easy to implement data exchange and integration of different data sources.

l XML helps e-commerce development. Data exchange technology adopted in the past e-commerce is EDI, which is a costly technology. It is difficult to adopt EDI for most companies. XML comes obviously reduces this difficulty. my country has put into develop XML-based e-commerce standards.

4. Development of native XML database core technology [36]

l Data storage

In the early days of XML database research, there was a debate in the industry. In the end, the XML data was stored in the relational library, or another physical database of XML is also developed. This affects XML researchers to a certain extent, must consider the indexed XML data in a variety of database structures when designing an index structure. Existing mainstream XML database products provide a Collection data structure under the underlying to store an XML element node, index these elements nodes via the B tree structure. This is the same as the underlying treatment of the relational library system. There will be one or two-level indexes above the Collection to speed up the query processing speed, which is more efficient and practical.

l Query processing

The method and efficiency of query processing have always been the primary issue of XML database research and developers.

Query processing of the XML database is typically starting from a query statement expression from a parsed query language such as XQuery. The Query parser of the XML database analyzes the expression to a query mode tree (some system is called syntax tree). Several new technologies have occurred in how to match the pattern tree. For example, in Timber, a technique called "structured coupling" is proposed, which has caused a large response in the academic community. Since 2002, there has been a technical improvement article in the three major international conferences (VLDB, Sigmod / Pods, ICDE) of the database, with more affected results is the Holistic Twig Join technology, which can be used. Interconnected multi-stack structure one-time generation query results documentation.

In the query processing phase, the minimization of the research mode tree. Its main thinking is that the matching efficiency of the pattern tree depends on the scale of the pattern tree (how much element node). The general mode tree has a certain amount of redundant node, which can be removed by minimizing algorithm. According to the experimental results of the Fudan University Database Research Center XML team show that the minimum algorithm can eliminate a 30% redundant node in a randomized XML query mode tree collection. Of course, the time complexity of the mode tree minimized algorithm is relatively high, and the current focus is mainly due to the reduction in time complexity. The time complexity of the latest algorithm has been reduced to low-order polynomial time. The practicality of this technology is worth looking forward to. l. Transaction processing and version control

Transaction processing should follow the ACID nature (atomic, consistency, independence and persistence) to ensure that most of the transaction is stably operated.

The current XML database generally provides transaction processing, including submission, withdrawal, and log files. The XML database guarantees the details of each transaction performed by providing a transaction log mechanism to ensure that you can completely recover after the system has problems.

The XML database also contains version control features for the XML document. Use version control, user or application to check (Check out) XML document, use version number, date, or tag to get the previous version of the previous version, and the version history information of the XML document. Each document under version control has its own historical information, record the author of the modification of the document, and the time, etc. Users can view the entire version history information according to documentation or user or date. Version control allows users to update their original information by querying. You can comment, modify and refine information by updating the engine. Built-in version of the system tracks information, providing historical information of these changes.

There is a need to explain the multi-transaction concurrent control mechanism and lock protocol. The R & D of this technology is currently just starting. Today's commercial XML database only provides concurrent locking protocol at the logical level, but the particle size is the entire document. With the increase of a single XML document, this particle size is obviously too thick. This may wait for the research community to develop a concurrent agreement with the granularity of the document element node.

l. Algebraic system and mode standardization

Algebraic expressions and database mode design theories have been the essence of relational database theory. The algebra system has become an important tool for the relational library query optimization, and the proposal of paradigm theory has also provided a basis for RDBMS design optimization. So, people can't help but ask if there is a similar theory and method in the XML database?

The three major experimental systems recognized in the academic community (TIMBER, the University of Michigan, TUKWILA, Washington University, Seattle, Wisconsin University, Niagara, Madison, Madison), have designed the corresponding algebra system, which is the most influential impact on Timber. TAX (TREE Algebra for XML). TAX is based on the basic unit of the operation of the entire document, providing a selection, projection, and two additional operations such as the logical layer, which are selected, projected, and two additional operations, to match the pattern tree to the instance tree (Witness Tree). . Seven basic operations are proposed in the physical layer to achieve the above logic.

TIMBER rewrites and optimizes the XML query statement based on TAX, but the effect is not ideal. The relationship between the industry to TAX is that excessively imitating an algebra system for relational libraries and ignores considerations for the characteristics of the XML document itself.

The early pioneers of the XML pattern standardized theory are Fan Wenfei, Pennsylvania University. From the definition XML key (key) and function dependent, to the XML and DTD paradigms, then to the mode standardization of the constraint-based XML database, the XML database mode standardization theory is steadily pushing. Domestic Fudan University Database Research Center and other units also have a good research progress. In the next two years, it is estimated that there will be more mature XML database mode design theories into laboratory products and commercial systems.

l Multi-data source integration

The integration of multi-proportion is the requirements for the database market for the XML database system. The integration of multi-proportional sources is just one of the advantages of XML database. Since 2001, the traditional relational database system in which multi-claims is integrated, the commercial database system such as iPedo has expanded its database system into an integrated platform, which can associate relational database systems, MIS systems, OA The system, file system, etc. are integrated on the same platform, providing users with a unified interface. For example, iPedo's iPedo XML smart platform provides users with XML View to unify the underlying heterogeneous data. People also further see the power of XML technology. 5. Future technology development direction

After nearly 5 years of colleagues, XML database technology has made great progress, and several XML database products have been introduced and served in various aspects of social life. However, the cause of the XML database has just begun, and there are many problems waiting for us to solve. In the next few years, XML database technology may make progress in the following aspects:

● Integration of heterogeneous data sources. The XML database is integrated with multi-proportional sources, which is a great play of XML technology scalability. However, it is still not enough for the current integration level and the function provided on the application layer. How to transition from integration of data to the system's integration, thus implementing a system similar to grid computing (Grid Computing) concept on the teletection target, probably one of the core tasks of the XML database worker.

● Bottom index structure. The current commercial XML database system is better than one of the characteristics of the laboratory prototype system is the index structure of its underlying. However, the underlying index structure of the existing commercial XML database is generally B tree. Although the B tree index is a mature index structure, the research results show that its performance performance is not the best in the XML database. The academic community has developed a number of index structures applicable to XML data, such as XR trees, XB trees, etc., and XML database workers need XML database workers to come further.

● Concurrently lock the protocol. In an existing XML database system, the locking particle size is the entire document, and the level of transaction is also in the document level. As the application-level document is increasing, this particle size will become a bottleneck of system efficiency to a certain extent. How to achieve the elements node granularity lock through the Edge Lock mechanism? This work has now attracted a lot of researchers, and the above-mentioned lock protocol is on the logic layer, how to map it to the base layer B tree index (or XR tree index), is also one thing that must be done. .

● XML mode specification is a direction worthy of attention. Once a breakthrough will make us easily design the structure of the XML database as soon as in the relational library, eliminate redundant and inconsistent phenomena. Currently, this area has become a hot spot in academia. However, complete, the theoretical system recognized by the industry has not yet been established.

7. List of native XML database products related to mobile or embedded systems

product

Developer

Database type

Berkeley DB XML

Sleepycat Software

Key-Value

Birdstep RDM XML

Birdstep

Object-oriented

Textml Server

IXIA, Inc.

ProPrietary (Text-Based)

Agilience Xpeerion Mobile XML

Agilience

Unknown

8. References

[1] Summary to the development of Haichu Database Technology [J] Modern Intelligence 2003, (12)

[2] Wang Shan Ding Guan Ming Zhang Xiao Mobile Database and Its Application [J] Computer Application 2000, (9)

[3] Summary of mobile database technology in Li Dong Cao Zhongli [J] Computer Application Research 2000, (10) [4] Wang Wei Wang Liang embedded mobile database review and evaluation [J] Computer Engineering 2001, (12)

[5] The latest development of Wang Zongjiang Le Jiajin Mobile Database [J] Zhengzhou Textile Institute of Technology 2001, (2)

[6] Jacob Christfort Mobile and Distributed Solutions [N] Computer World News Issue 11 B2

[7] Li An Yu Lin Lijie and other embedded mobile databases: from research direction [J] China Computer 2003, (2)

[8] Feng Yucai Li Dong et al, the architecture of mobile database management system [J] Computer Research and Development 2001, (5)

[9] Lin Huaizhong, Chen Chun, etc. Dynamic realization of transaction consistency in mobile environment [J] Computer Research and Development 2002, (1)

[10] Wang Shan Ding Guanming. Mobile Database in Mobile Calculation [N]. Microcomputer World 2001, (8)

[11] Zhang Xiao Wang Shan Du Xiaoyong. The embedded database is left a fragrance [N]. China Computer 2001, (8)

[12] Zhu Ying embedded mobile database and related issues [J] Journal of Guizhou Electronics and Technology 2003, (6)

[13] Liu Degui mobile computing technology development review [J] Mini machine and application 1999, (9)

[14] Anonymous Mobile Calculation Technology and Its Application [J] Software World 2000, (11)

[15] Cai Zhongshan Mobile computing and mobile database in my country's application prospects [J] Software World 2001, (4)

[16] Xu Li臻. Database Affairs Management in Mobile Computing Environment, Jiang Mingfei [J] Journal of Southeast University 2002, (6)

[17] Xiong Yan. Database Access Technology Based on Mobile Agent in Miri Miao Huan, etc. [J] Small Micro Computer System 2002, (10)

[18] Huang Jinzheng Cai Yuxi Embedded Mobile Database Application Research [J] Journal of Donghua University 2002, (10)

[19] Zheng Wei XML Database [N] Personal Computer 2002, (9)

[20] Li Yingmei and other XML databases and related issues [J] Journal of Natural Science, Harbin Normal University, 2002, (6)

[21] Zhou Yong et al. XML database and relational database collaborative research [J] Computer Engineering and Application 2002, (13)

[22] Zhao Junyi XML and Database [J] Journal of Inner Mongolia University (Natural Science Edition) 2003, (3)

[23] Zhang Suzhi Lu Zhengding Li Chunlin. XML Database and Its Application Research [J] Computer Engineering and Application, 2002, (8)

[24] Zhu Liang Native XML Database Technology 2003, (2) http://www-900.ibm.com/developerWorks/cn/xml/x-nxd/index.shtml

[25] Ronald Bourret XML and Databases 2004, (7) http://www.rpbourret.com/xml/xmlanddatabases.htm

[26] Agilience Xpeerion Mobile XML Database version: 1.0 Company: agilience http://www.xml.com/pub/p/690[27] Berkeley DB / Berkeley DB XML http://www.sleepycat.com/download/ Whichdownload.shtml

[28] Luo Qian et al. Based on mobile database, heterogeneous data consistency technology [J] Journal of Chongqing Teachers College 2003, (4)

[29] Song Weihong. Liu Hong SYNCML Synchronous Agreement Analysis [N] Telecom Express 2003, (7)

[30] Ren Li Gang. Song Junde Analysis Data Synchronization Agreement - Syncml [J] China Data Communication 2002, (10)

[31] Birdstep Database Management Software http://www.birdstep.com/Database_technology/index.php3

[32] Andy Sjöström Manage XML Using .NET Compact Framework, May 2003http: //msdn.microsoft.com/mobility/understanding/articles/default.aspx pull = / library / en-us / dnppc2k3 / html / mgexmlnetcpctfrmwrk.asp?

[33] http://www.ixiasoft.com/default.asp?xml=/xmldocs/webpages/textml-server.xml

[34] Ronald Bourret XML Database Products: Discontinued 2004, (1) http://www.rpbourret.com/xml/prodsdiscontinued.htm

[35] Ronald Bourret XML Database Products 2004, (9) http://www.rpbourret.com/xml/xmldatabaseProds.htm

[36] Pang Bang XML Database: Latest Development and Development Direction [N] Computer World News 2004, (9) http://www2.ccw.com.cn/04/0436/b/0436b52_1.asp

[37] Research and Implementation of XML Database Storage Technology, etc. [J] Computer Engineering 2002, (7)

[38] Research on Indexing Technology of XML Database Based on DOM of JAL FOM [J] Computer Research and Development 2004, (1)

[39] Ronald Bourret Mapping Dtds To Databases 2001, http://www.rpbourret.com/ XML / DTDTODATABASE.HTM

[40] Tong Mima. Research on Query Language Characteristics of Jin Yuanping XML Database [J] Computer Application 2001, (09)

[41] Luo Yanmin. Database Technology and Application of XML [J] Journal of OC University (Natural Science Edition) 2003, (3)

[42] Nicholas Chase appreciated DOM http://www-900.ibm.com/developerWorks/cn/cnedu.nsf/xml-onlinecourse-bytitle/386674F65A47844C48256BD10023D453?OpenDocument[43] Nicholas Chase appreciated SAX http: // www-900 .ibm.com / developerWorks / CN / CNEDU.NSF / XML-onlineCourse-Bytitle / CA45E09F1E2EF41E48256B1B000C6C7B? OPENDOCUMENT

[44] http://www.ipedo.com/html/ipedo_XML_DATABASE.HTML

[45] http://www.perfectXml.com/soft.asp?cat=15

[46] Kevin Williams Native-XML Database: A bad idea about data 2001, (10) http://www-900.ibm.com/developerworks/cn/xml/x-data/part5/index.shtml

[47] Kevin Williams Used XML: Four Skills of Flexible Architecture 2001, (8) http://www-900.ibm.com/developerWorks/cn/xml/x-data/part4/index.shtml

[48] http://www.ipedo.com.cn/product/faq.jsp?menu=2

[49] Seven Good Reasons to Choose XML http://www2.softwareag.com/corporate/products/tamino/prod_info/7goodreasons.asp

转载请注明原文地址:https://www.9cbs.com/read-107908.html

9cbs

New Post(0)