http://www.vbdotnetHeaven.com/code/jul2003/2120.asp
Abstract
Based on a real world project this article shows how to handle XML data in .NET using VB.NET DataSets and DataViews. The article presents step-by-step how to bind, read and view XML data and generate graphical reports with the build- In Crystal Reports Engine. The article Putardization and Interoperability Issues Experienced During The Project, And Presents Workarounds for Existing Limitations.
Introduction
This project was part of a larger project in the context of training acquisition and analysis A Palm PDA is used for mobile data acquisition;.. These data are transferred onto a PC into a well-formed and valid XML data file The task of this project Was to Develop a Sophistated Data Analysis and reporting Application on the PC Using The Given XML Data File.
The .NET framework was chosen for the PC application because of its great data handling capabilities, its powerful graphic functionality and the built-in reporting engine Crystal Reports. We selected the programming language VB.NET because we believe, that this is the language of The Future Under .Net and Because The IS No Major Difference Between The Supported Languages In vs.net with respect to their office functionality.
The Idea Was That Both, The Existing Palm Application and The New VB.NET Application Can Share One Common XML File As ITA Source.
XML Access Methods In .NET
.NET offers numerous XML-related classes for reading and writing XML documents. Although the documentation states, that any class conforms to the W3C XML 1.0 standard, there are trade-offs in respect to performance, efficiency, productivity, and XML compliance between the different implementations.So we first investigated into these different access methods and then selected the most appropriate for our application. This step also addresses important issues that must be considered when dealing with legacy valid and well-formed XML data in .NET.The VB. Net Developer Can Use Any of The Following Five Methods To Access XML Data:
XMLTextReader XmlValidatingReader XMLDocument (and the rest of the dom api), Xpathnavigator. ADO.NET Datasets
The detailed description of these methods would go beyond the scope of this article. Thus the following table summarizes very briefly the characteristics of each method and shows when to use which technology.
XMLTextReader:
You ued Only Read Access
Performance is your highest priority
You don't need XSD / DTD Validation
You don't need xsd type information at runtime
You don't need XPath / XSLT Services
XMLValidatingReader
You ued Only Read Access
You NEED XSD / DTD Validation
You NEED XSD TYPE INFORMATION AT RUNTIME
XMLDocument
You need to update the document (read / write)
Productivity is your highest priority
You need XPath Services
Xpathnavigator
You ued Only Read Access
You NEED TO EXECUTE AN Xslt Transformation
You want to leverage an importation (like xpathdocument) ADO.NET DATASETS
You need to update the document (read / write)
You NEED GRETEST FLEXIBILITY AND FUNCTIONALITY
You Want Greatest Tool Support
You NEED INTEROP-Support
Table 1. XML Data Access Methods in .NET
When dealing with complex XML Schemas, then ADO.NET DataSet offers greatest support and flexibility. The DataSet, which can be used either relational databases or XML data files, is an in-memory cache of retrieved data. It is the major component for data Handling in the ado.net architecture.
The DataSet provides a consistent relational programming model regardless of the data source; thus it consists of a collection of DataTable objects that you can relate to each other with DataRelation objects A DataSet reads and writes data and schema as XML documents The data and schema.. Can Be Transported Across Http and use by any application, on any platform That IS XML-enabled.
Considering All these features and the fect That This VB.NET Application Will Need To Be Webservice Interoperable with a Server Application In Future We Decided to Go for DataSet 抯.
XML Conformance In .NET
Going with datasets, readxml and readxmlschema are the methods of choice to read the xml data, and the corresponding XML schema.
However doing so with the given valid and well-factoryd XML Data file, raised the folloadinger error:
揟 HE Same Table {Description} Cannot Be The Child Table in Two Nested Relations
Looking deeper into the file shows that the valid and well-formed (and thus fully W3C compliant) XML file had a child table called description which had more than one parent table. This part of the XML Schema is shown in Figure 1. Figure 1. Original XML Schema
This is a valid and allowed design according to the W3C XML schema specification. (Refer: W3C definition of Parent child Relationship) Also the XMLSpy tool validated this XML schema as valid and well-formed..
More investigation showed that the .NET XML framework does not (yet?) Support the full XML 1.0 standard. It does not support recursive schemas or designs where a child node type is beneath more than one parent node type. Of course, also all classes Based on DataSets, Like The System.windows.Forms.DataGrid, Which Wee Were Going to Use, Have The Same Restriction. in Other Words:
XML Design Issue 1 :.NET Datasets Require That A Table Can Only Be a child of one Other Table.
Microsoft Has Identified THIS A BUG (Q325695) AND THEIR KNOWEDGE BASE ARTICLE [MS-Q325695] Gives More Information.There Are Essentially Three Ways To Solve this qu.
Write Custom Functions Write a Converter Change Your XML Schema
IF you can 抰 Break change The existing XML Schema (for Example Because you are not the output)
Customizing all your code means you write special methods for the XmlReader class and add records to a table with a hierarchical type design. You also have to implement a custom filtering mechanism if you want to display this data in bound controls, and a custom writing mechanism To write to the Type of Hierarchical Format You Are Reading from. Even Worsse, Any Cause In All of YourCustom Functions So this isn 抰 really an.
The second option, if you can Yang change the given XML Schema, is to write a converter. That means you define a new .NET compliant XML Schema for the given XML file and make use of the given DataSet capabilities. Then you write a bi -directional converter to conve convert the data from one xml scherature. Althought, IT Causes The Least Additional Effort.
If you are the owner of the XML Schema file, then solution is to change the given the given XML Schema and make it also .NET compliant. In our situation, this meant, to define three different description type structures (Idescription, Ddecscription, Fdescription ), for the three, occurrences of the description.
To avoid multiple definitions of the same structure (which can cause maintenance errors in the future), we made use of the inheritance capabilities of XML Schema. We derived the three description types from the still existing base structure back escription. The details are shown In Figure 2.
Figure 2. Avoid One Child with Several Parents
Reading XML Data with DataSets
VB.NET developers have several approaches of using a DataSet in the applications. This section explains the chosen approach to create a DataSet from the modified XML Schema, the tools that were used and how we populated the DataSet from the XML source file.A DataSet can be typed or untyped. A typed DataSet is a dataset, which is created based a given XML Schema definition file (XSD file). Information from the schema (tables, columns, etc.) is generated and compiled into the new DataSet class as A Set of First-Class Objects and Properties.
Because A Typed DataSet Class, The Typed Class Assumes All of the FunctionAn The DataSet Class and Can Be Used with Methods Take An Instance of a Dataset Class As a parameter
An untyped DataSet, in contrast, has no corresponding built-in schema. As in a typed dataset, an untyped DataSet contains tables, columns but those are exposed only as collections. (However, after manually creating the tables and other data elements in An untyped dataset, you can export the dataset 's structure as a schema using the dataset' s writexmlschema method.)
You can use either type of DataSet in your application. However, Visual Studio has more tool support for typed DataSets, and they make programming with the DataSet much easier and less error-prone. So having considered all these options the decision was to go with ADO.NET TYPED DATASETS.
Typed Datasets can be generated with the XSD.EXE tool, which is part of the VS.NET environment. The tool accepts a valid XML Schema file as input, as well as the language to use (C #, VB). The following line shows .
Xsd.exe / d /l:vb.net xsdschemaFilename.xsd /n: xiSdschema.namespace
The / d directive tells the tool to generate DataSets, / l specifies the language to use, the optional / n defines the namespace to generate. The generated DataSet classes will be saved in the source file XSDSchemaFileName.vb.
Once The Typed DataSet Classes Are Generated, The Further Procedure IS Almost A Child 抯 Play. The Provided Methods and Properties Guarantee Data Access in a Type Manner.
So the next step was to populate the Typed DataSet at runtime from the XML file The ReadXml () and the WriteXml () methods of the typed DataSet class do this very easily without any difficulty as the following to lines of code show.:
Dim myds as new dataset () myds.readXML ("Input.xml", XMLReadMode.Readschema)
Viewing XML Data
Having created and populated the DataSet the next step was to bind the data to the windows controls for user interaction. Since we were going to implement a grid view, this involved binding DataTables to DataGrids with user navigation facilities and providing parent-child relationships in the Grid View So When A User Selects a Parent Row, The Corresponding Child Rows Are To Be Shown Automatically for Editing Purposes.
The DataTable is the primary building block of ADO.NET. The DataTable is made up of a DataColumn and DataRow collections. The columns define the schema of the DataTable and the rows make up the actual data in the DataTable. A DataView is a bindable, customized view of a DataTable. you can create multiple DataView's of the same DataTable, each one can contain different data sorted in different order. Additionally, you can add, delete, or edit information in each DataView.
DataTable and DataView use the same instances of DataColumns, ie they share the common structure. However DataTable and DataView each have its own row collections. The DataTable is consists of DataRow's while the DataView is made up DataRowView's. Figure 3 shows the relationship between DataSet, DataTable and DataView.figure 3. Tables and views
The data binding of typed DataSets is very smooth in VB.NET. You can bind the data sources to the WindowsForms controls at design time or runtime without any difficulty. For example, you can specify which data column to be bound to a ComboBox in a Windowsform by Setting The Datasource and DataMember Property of That Control So That It gets Automatically Bound At Runtime.
We used the .NET DataView class to bind DataTables to the DataGrid. DataView supports data filtering and data sorting at design time or at runtime. For convenience a default DataView is generated for the developer for customization.
In The Analysis Part, The Application Needs to Display Some Statistical Information, Which IS Calculated from Several Column and Which IS Shown in Some Dynamical Generated Extra Column SHOWN Figure 4.
Figure 4. The Data View
Some of the data columns needed for these calculations were in different parent and child tables. Unfortunately it was not possible join two tables in the dataset to show up in a single DataGrid. (For example, by using the DataSet.Relations property) .To further clarify the situation say I have two Parent & child tables Table1 (id, columnA, columnB) and Table2 (id, columnC, columnD) and I want resulting datagrid columns to be view like (columnA, columnB, columnC, columnD). It Is Similar To The Situation Id I Had Wrote a SQL Query Like
Select A.ID, A.Columna, A.Columnb, B. [Columnc], B. [Columnd]
From Table1 A, Table2 B
WHERE A.ID = B.ID
IT IS IMPORTANT To Understand That DataView Is Not Equivalent of A SQL View. A Dataset Does Not Contain A Query Processor and Has No Mechanism To Perform a Join.
The Other Way To Do IT (Most Developers Seem to Suggest) Is To Simply Create A New DataTable
That is typed the way you want it and manually moved the data from both datatables to the new one. Also any data manipulation has to copy over to the original tables using a nested loops. This is a brute force method. It would be nice to Here if anyone com up with something else.
Finally more references were added to the schema file. But that is not the best workaround. The reason that the columns could not be merged is founded in the relational programming model of the DataSets. The existing XML file uses nested child tables instead of references. This is not (yet?) Resolved by the .NET DataSets, So That References Are missing to cross-reference tables and to establish rellationships.
XML Design Issue 2: 揢 SE REFERENCES IN XML Data Files INSTEAD OF NESTED TABLES TO SHOW UPUED TABLE IN A SINGLE DATAGRID
So The next change. To introduce references where needed.
Figure 5. adding references to Tables
In our application we use the Date column as a filter criteria for the generated graphs in the Crystal Report Engine. Thus in Figure 5 a Date column has been added to the session table, which is a direct reference of the date column of the day table .
Reporting XML Data
Having completed the data binding to Windows forms the next task was to generate the summarized reports in both graphical and tabular forms. We have chosen the Crystal Reports Engine (which is delivered with the VS.NET environment) with its powerful dynamic reporting capabilities.Crystal Reports for Visual Studio .Net Offers The Following Three Object Models:
The Engine Object Model
The Windows Forms Viewer Object Model
The Web Forms Viewer Object Model
The top level Engine object model is the report document object. The report document object offers sophisticated control over your object. It contains all the properties and methods needed to interface with and customize a report. How ever it does not provide the display of the reports .
THIS WHERE VIEWER Objects and The Report Document Objects Come Together. The Windows Forms Viewer Allows You to View A Report in A Windows Application and The Web Forms Viewer Does It for a Web Application.
So we selected the Engine Object model together with the Windows Object Model to accomplish the requirements. The Vb.NET CrystalDecisions.Windows.Forms namespace in the CrystalDecisions.Windows.Forms.dll assembly provides support for the Windows Forms Viewer control and its associated classes . It can also Dynamically Update The Report Being Hosted and Interact with Controls Wtems Withnin A Windows Application.
When using the Crystal Report Engine you basically have two options for the selection of data source. The first one is passing the raw XML file, and the second is passing the populated DataSet to the Crystal Report Engine.
When using the first approach you have to verify on every usage, that the XML file contains correct data types (data validation). Otherwise Crystal Report Engine would interpret the data types in the XML files differently (eg float are used as strings), which prevents the data from being processed further within Crystal Reports.The second, recommended option is passing the already populated typed DataSets to the Crystal Report Engine. This saves the runtime verification of data types. The content and the scope of the report is determined at runtime According to the user selection of the required report format. Crystal Reports Provides Template Files for this. Figure 6 shows a sample report.
Figure 6. Crystal Report View
Summary
Visual Studio .NET has a very rich set of tightly integrated tools that work together to support C # application developer. For XML based applications it provides schema generation and validation tools. It offers the methods of choice for XML handling. Especially the DataSet classes make reading And Writing of XML Data Extreme Easy. The Dataset Classes Provide Easy Data Binding to the Gui and To Even Third-Party Tools Like Crystal Reports for Advanced Reporting.
Other very nice new features are the new source code documentation support and application installer production. The code documentation tool generates a very nice HTML documentation (they improved the JavaDoc style), which is completely XML based. The integrated application installer allows easy creation of a Setup program for your application.
Despite The Currently Existing Limitations of DataSets Concerning ITS XML Standard Support, It is The Tool of Choice for XML Data Handling with .net.
Once your XML data sources confirm to the .NET XML standard the rest can be handled without much difficulty. It offers lot of built-in powerful classes and methods for reading, writing and displaying data. This allows the developer to concentrate on the More Important Application Logic Rather Than Having to Worry About Data Handling Issues Or Framework Specifics.references
[MSVSNet] Visual Studio .NET Help, Microsoft Visual Studio IDE V7.0
[W3CXML] EXTENSIBLE MARKUP LANGUAGE (XML) 1.0, 2nd Ed., Http://www.w3.org/tr/2000/rec-xml-20001006, Oct 2000
[W3cdom] Document Object Model Technical Reports, http://www.w3.org/dom/domtr
[W3cchi] w3c definition of pent child rellationship, http://www.w3.org/tr/2000/sec-well-formed
[MS-Q325695] 揈 Rror Message When You View Multi-Dimensional XSD Schemas in The Visual Studio .NET XML Schema Designer http://support.microsoft.com/default.aspx?scid=kb;n-us;q325695

