Conversion Guide: Transfer the program from managed C ++ to C ++CLI

xiaoxiao2021-03-06 150

Conversion Guide: Migrate the program from managed extended C to C / CLI

STANLEY B. Lippman Microsoft

Translation: Jiang Wei

August 2004

Suitable for: C / CLI second edition ISO-C

Abstract: C / CLI represents a dynamic model extension of an ISO-C language standard. This article lists the features of the V1 version language, and their correspondence in V2 versions (if present); and point out the language characteristics of the corresponding V1 feature. (68 Print Page)

Translator Note:

The original address is at http://msdn.microsoft.com/visualc/default.aspx?pull=/library/en-us/dnvs05/html/transguide.asp (English). This article has been published on the MSDN Chinese website, the URL is http://www.microsoft.com/china/msdn/library/langtool/vcpp/transguide.mspx. The MSDN version has a modification of the articles in the C / CLI language specification MSDN on the Microsoft website: http://msdn.microsoft.com/visualc/rs.xml

table of Contents

Introduction Language Keyword Tube Data Type Class or Member Declare Value Type and Its Behavior Language Change Overview

Appendix drives a revised language design

thank

Introduction

C / CLI represents a dynamic programming paradigm extension in the ISO-C standard language. There are many significant weaknesses in the original language design (V1), we feel that in the revised language design (V2) has been corrected. . This article lists the features of the V1 version language and their correspondence in V2 versions (if such a corresponding presence, it is pointed out as the language characteristics of the V1 feature that does not exist. For interested readers, the appendix is provided in the appendix. In addition, a source code level conversion tool (MSCFRONT) is being developed, and may provide people who want automated porting V1 code to new language design in the release of C / CLI. This article is divided into five chapters plus an appendix.

The first section discusses the main language keywords, especially the removal of the double downline and

Contextuality and

Segment keyword.

The second festival is on the change of the type of hosted data - especially hosting

Quote types and array types. It is also possible to find a detailed discussion of Deterministic Finalization. About changes in class members, such as attributes, index properties, and operators,

The focus of the third quarter.

Section II focused on the grammatical changes of hosting enumeration, internal and constraints. It also discusses many considerable semantic changes, such as implicit packing, hosted

CLI

Changes of enumeration, and

The support of the value class default constructor is removed.

The fifth section has a bit like hodgepodi - a famous wolf

Miscellaneous. You can find behavior and parameters for type conversion symbols, string constants in it.

Discussion of arrays.

Language keyword

A general conversion of the original version to the revision is to remove the double underscore in all keywords. For example, an attribute is now declared as Property instead of __property. Two main reasons for using double-downline prefix in the original design is:

This is a method of providing a language extension that meets ISO-C standards. One of the main purposes of the original language design is the incompatibility of untrunnation and standard languages, such as new keywords and tags. This reason greatly promotes the choice of pointer syntax for the statement of the recorded type of custodial reference. Double underscore use, in addition to compatibility, it is also a reasonable guarantee that does not affect the user's basic code. This is the second main purpose of the original design.

In this case, why do we remove dual underline (and introduce some new tags)? No, this does not mean that we will no longer consider and maintain consistent! We continue to be consistent with standards. Despite this, we realized that the support of the CLI dynamic object model exhibited a new powerful programming model. Our experience in the original design and the design and development of C make us convinced that the support of this new model requires its own advanced keywords and tags. We want to provide a new model of first-class expression, integrating it and support standard languages. We hope that you will feel the revised language design provides a first-class programming experience for these two distinctive object models.

Similarly, we care about minimizing the impact of these new keywords to existing code. This is solved with contextual and segment keywords. Before we look at the revision of the actual language grammar, let's try to figure out the flavor of these two special keywords.

A contextual keyword has a special meaning in a particular locale. For example, in a usual program, Sealed is identified as a normal identifier. However, in a statement of a managed category type, it is identified as a keyword in the category context. This minimizes the potential impact of introducing a new keyword in the language, and we feel that this is very important for users with our old code base. At the same time, it allows new features to get a first-class new language characteristic experience - we are missing these factors in the original design. We will see Sealed usage in Sea in Sea.

A segment key is a special case for contextual keywords. Field is a contextual modifier and an existing keyword pair, separated by space. This pair is identified as a syntax unit, such as a Value Class (see 2.1), not two separate keywords. Based on reality, this means a macro of redefining the value of value, as shown below:

#ifndef __cplusplus_cli

#define value

Will you remove Value in a class declaration. If you do so, you have to redefine the syntax pair as described below:

#ifndef __cplusplus_cli

#define value class class

This is necessary to take into account the factors of reality. Otherwise, existing #define may convert the contextual keyword section of the segment keyword. (Translator Note: For example, in January 2003, the #define interface structure in the platform SDK header file, see http://blog.joycode.com/jiangsheng/archive/2004/12/17/41283.aspx).

2. Managed data type

Declaring managed data types and creation, and using these types of objects have been greatly modified to increase compatibility to ISO-C type systems. These changes are detailed in the subsequent section. The discussion of the entrustment delayed from Section 3.3 to express them in an event member of the class - this is the topic of Section 3. (About more detailed tracking quotation grammar introduces the discussion of the main transformation in insider and design, see the appendix promotion revision language design.)

2.1 Declare a managed class type

In the original language definition, a reference class is starting with the __gc key. In the revised language, __ gc keyword is replaced by one of two segment key Ref classes or REF STRUCT. Struct or Class selection only indicates the default disclosure of the portion that does not explicitly accessed at the beginning of the type body (for struct) or private (for Class) default access level.

Similarly, in the original language design, a reference class is starting with the __value keyword. In the revised language, __ value keyword is replaced by one of two segment key Value Class or Value Struct.

In the original language design, an interface type is indicated by keyword __interface. In a revised language, it is replaced by Interface Class. For example, the following statement set

// original grammar

Public __gc class block {...}; // reference class

PUBLIC __VALUE CLASS Vector {...}; // value class

Public __interface iMyfile {...}; // Interface class

The equivalent declaration under the revised language design is as follows:

// Rev. syntax

Public Ref class block {...};

Public value class vector {...};

Public interface class iMyfile {...};

The idea that REF (for reference types) instead of GC (for garbage collection) is to better imply the essence of this type.

2.1.1 Specify a class as an abstract type

In the original language definition, the keyword __abstract can be placed before the type keyword (before __gc) to indicate that the class has not been completed, and such objects cannot be created in the program:

PUBLIC __GC __ABSTRACT CLASS Shape {};

PUBLIC __GC __ABSTRACT CLASS Shape2D: Public Shape {};

In the revised language design, the Abstract contextual keyword is defined after the class name, the class, the base class is derived before or the semicolon.

Public Ref class shape abstract {};

Public Ref class shape2d abstract: public shape {};

Of course, the semantics have not changed.

2.1.2 Specify a class as a closed type

In the original language definition, the keyword __sealed is placed before the class keyword (before __gc) to indicate that the class cannot be inherited:

PUBLIC __GC __SEALED CLASS STRING {};

In the V2 language design, the Sealed contextual keyword is limited to the class name, the class, the base class derived list or semicolon (you can close it while declaring a inherited class. For example, String class implicitly Self Object). The advantage of enclosing a class is to allow static (that is, when compiling, the object calls for this seal reference type object are parsed. This is because the sealing indicator guarantees the String tracking handle that cannot point to a derived class object that may overload the triggered false method.

Public Ref class string sealed {};

You can also declare a class that is also declared as a closed class. This is a special case called a static class. This is described below in the CLI documentation

At the same time, you can only have a static member for abstract and closed types, and the same manner as the namespace is called in some languages.

For example, this is a statement of abstract closed class using V1 syntax

Public __gc __sealed __ABSTract CLASS STATE

{

PUBLIC:

STATIC state ();

Static Bool INPARAMLIST ();

Private:

Static bool ms_inparam;

}

This is this statement in the revised language design:

Public Ref Class State Abstract Sealed

{

PUBLIC:

STATIC state ();

Static Bool INPARAMLIST ();

Private:

Static bool ms_inparam;

}

2.1.3 CLI inheritance: Specify the base class

In the CLI object model, only single inheritance of public mode is supported. However, in the original language definition, ISO-C explains the base class is still retained. The base class without access to the keyword will be a private derived type by default. This means that every CLI inheritance declares have to replace the default explanation with a public keyword. Many users think that the compiler seems to be too rigorous. // v1: Error: The default is privately derived

__gc class my: file {};

In the revised language definition, the CLI inheritance defines the lack of access to the keyword, default is derived in public way. In this way, the public access key is no longer necessary, but optional. Although this change does not need to make any modifications to V1, I will still list this change for integrity.

// v2: Correct: By default is the publicity derived

Ref class my: file {};

2.2 References of a CLI reference object

In the original language definition, a reference class type object is based on the ISO-C Pointer syntax, and an optional __gc keyword is used on the left side of the asterisk. For example, this is a statement of multiple reference type types under V1 syntax:

PUBLIC __GC CLASS FORM1: PUBLIC System :: Windows :: Forms :: form {

Private:

System :: ComponentModel :: Container__gc * Components;

Button __gc * button1;

DataGrid __gc * mydatagrid;

Dataset __gc * mydataset;

Void PrintValues (array * myarr)

{

System :: Collections :: ienumerator * myenumerator =

Myarr-> getenumerator ();

Array * localarray = myarr-> copy ();

// ...

}

In the revised language design, the reference class type object is declared with a new declarative symbol (^), formally expressed as a tracking handle, and more unfair expression is a hat. (Tracking this adjective emphasizes that the reference type object is located in the CLI heap, so it can be transparently moving in the stack of garbage collection compression processes. A track handle is transparently updated during runtime. Two similar concepts: (a) Track reference (%) and (b) internal pointers (interior_ptr <>), discussed in Section 4.4.3.

Declaration syntax no longer reuses ISO-C pointer syntax has two main reasons:

The use of the pointer syntax does not allow the operator to be reused to reference objects; there is to call the internal name of the operator, such as RV1-> op_addition (RV2) instead of more intuitive RV2 RV2. Many pointer operations such as type forced conversion and pointer arithmetic are invalid for objects located on the garbage collection stack. We feel that the concept of a track handle is preferably in line with the nature of a CLI reference type.

Use the __gc modifier for a tracking handle and is unnecessary and not supported. The usage of the object itself has not changed, which is still accessing the member (->) accessed by the pointer member. For example, this is the result of the above V1 text translation to the revised language syntax:

Public Ref Class Form1: Public System :: Windows :: Forms :: form {

Private:

System :: ComponentModel :: Container ^ Components;

Button ^ Button1;

DataGrid ^ MyDataGrid;

Dataset ^ myDataSet;

Void PrintValues (array ^ myarr)

{

System :: Collections :: ienumerator ^ myenumerator =

Myarr-> getenumerator (); array ^ localarray = myarr-> copy ();

// ...

}

(Translator Note: ^ Reference The entire object in the hosted stack cannot be used to point to the inside of the type.)

2.2.1 Dynamic allocation objects on the CLI pile

In the original language design, the existing NEW expressions allocated on conventional stacks and hosted stacks are largely transparent. In almost all cases, the compiler can correctly determine the conventional stack or hosted stacks from the context. E.g:

Button * Button1 = new button; // Good: House

INT * pi1 = new int; // Good: Traditional Pile

INT32 * PI2 = New Int32; // Good: Household

The result of the contextual heap allocation is not expected, and the __gc or __nogc keyword guide compiler can be used. In a revised language, use the newly introduced GCNEW keyword to significantlyify the different nature of the two New Expressions. For example, the above declarations look like this in the revised language:

Button ^ Button1 = gcnew button; // Good: Household

INT * pi1 = new int; // Good: Traditional Pile

Interior_ptr pi2 = gcnew int32; // Good: hosted pile

(Discussing more detail of Interior_PTR in Section 4. Usually, it represents an address of an object, this object does not have to be on the hosted stack. If the point to which the object is indeed located, it is transparent when the object is repositioned Update)

This is the initialization of the Form1 member V1 version declared in front section:

void initializecomponent ()

{

Components = New System :: ComponentModel :: Container ();

Button1 = new system :: windows :: forms :: button ();

MyDataGrid = new datagrid ();

Button1-> Click =

New system :: EventHandler (this, & form1 :: button1_click);

// ...

}

This is the same initialization process rewritten with the revised syntax, and the reference type is the result of a GCNEW expression that does not require a "hat".

void initializecomponent ()

{

Components = GCNEW System :: ComponentModel :: Container

Button1 = gcnew system :: windows :: forms :: button;

MyDataGrid = GCNew DataGrid;

Button1-> Click =

GCNEW System :: EventHandler (this, & form1 :: button1_click);

// ...

}

2.2.2 an empty object tracking reference

In a new language design, 0 no longer represents an empty address, but is processed as an integer, like 1, 10, 100, so that we need to introduce a special mark to represent an null value tracking reference. For example, in the original language design, we will initiallyify a reference type as an empty object reference:

/ / Correct: We set OBJ not to reference any object

Object * Obj = 0;

// Error: No implicit box

Object * Obj2 = 1; In the revised language, any initialization or assignment of any slave value to an Object causes an implicit packing of a value type (Implicit Boxing). In the revised language, OBJ and OBJ2 are initialized to pack-over INT32 objects, each having values 0 and 1, respectively. E.g

// lead to implicit boxes of 0 and 1

Object ^ obj = 0;

Object ^ obj2 = 1;

Therefore, in order to allow explicit initialization, the value is assigned a tracking handle, and we introduce a new keyword, NULLPTR. The correct revision of the V1 example appears as follows:

/ Ok: We set up OBJ does not quote any object

Object ^ obj = nullptr;

/ Ok: We initialize Obj as an int32 ^

Object ^ obj2 = 1;

This makes the transplant designed from the existing V1 code to the revised language. For example, consider the following declaration:

__Value struct holdingr {// original V1 syntax

Holder (Continuation * C, SEXPR * V)

{

CONT = C;

Value = v;

Args = 0;

ENV = 0;

}

Private:

CONTINUATION * CONT;

Sexpr * Value;

Environment * ENV;

Sexpr * args __gc [];

}

Here, Args and ENV are CLI reference types. In the process function, the statement that initializes it is 0 must be modified to Nullptr during the transfer to a new grammatical process.

// Revision V2 syntax

Value Struct Holder

{

Holder (Continuation ^ C, SEXPR ^ V)

{

CONT = C;

Value = v;

Args = NULLPTR;

ENV = NULLPTR;

}

Private:

CONTINUATION ^ Cont;

SEXPR ^ Value;

Environment ^ ENV;

Array ^ args;

}

Similarly, these members and 0 comparisons must also be changed to NullPtr comparison. This is the original syntax:

// Original V1 syntax

Sexpr * loop (Sexpr * Input)

{

Value = 0;

Holder Holder = Interpret (this, Input, ENV);

While (Holder.Cont! = 0)

{

IF (Holder.env! = 0)

{

Holder = interpret (Holder.cont, Holder.Value, Holder.env);

}

Else IF (Holder.Args! = 0)

{

Holder =

Holder.Value-> Closure () ->

Apply (Holder.cont, Holder.Args);

}

Return Value;

}

And here is a revised edition. Converting each 0 to Nullptr (the translation tool is helpful for this conversion, automatically handles many - if not all - instances, including using null macros.

// Revision V2 syntax

SEXPR ^ loop (sexpr ^ input)

{

Value = NULLPTR;

Holder Holder = Interpret (this, Input, ENV);

While (Holder.Cont! = Nullptr)

{

IF (Holder.Env! = nullptr) {

Holder = interpret (Holder.cont, Holder.Value, Holder.env);

}

Else if (Holder.Args! = nullptr)

{

Holder =

Holder.Value-> Closure () ->

Apply (Holder.cont, Holder.Args);

}

Return Value;

}

Nullptr can be converted into any tracking handle type or pointer, but it cannot be upgraded to a integer type. For example, in the initialization set, NullPtr is only valid in both initial values at the beginning.

/ / Correct: We set OBJ and PSTR do not quote any object

Object ^ obj = nullptr;

Char * pstr = nullptr; / / 0 can also be used here

// Error: There is no conversion from Nullptr to 0 ...

INT IVAL = NULLPTR;

Similarly, a given method set is as follows:

Void f (Object ^); // (1)

Void f (char *); // (2)

Void f (int); // (3)

The call to use Nullptr is as follows

// Error: Ambiguity: Match (1) and (2)

f (Nullptr);

It is ambiguous because NullPtr matches a tracking handle and matches a pointer, and there is no priority selection in both (this requires an explicit type forced conversion to eliminate ambiguity).

A call using 0 is just matching examples (3):

/ / Correct: Match (3)

f (0);

Since 0 is integer. When there is no F (int), it matches F (char *) through a standard conversion. When there is no precise match, the standard conversion is given to the priority of implicit packments for value types. This is why there is no ambiguity here.

2.3 CLI array declaration

The statement of the CLI array in the original language design is a little unertally expanded by the standard array statement. A __GC keyword is placed between the array object name and the possible comma-filled dimensions, as shown in the following example:

// v1 syntax

Void PrintValues (Object * myarr __gc []);

Void PrintValues (int myarr __gc [,]);

This is simplified in the revised language design. We use a vector declaration that mimic STL similar to template. The first parameter specifies the element type. The second parameter specifies the array dimension (the default is 1, so only the multi-dimensional array requires the second parameter). Array object itself is a tracking handle, so a hat must be given. If the element type is also a reference type, they must also be marked. For example, the above example, when it is expressed in a revised language, it looks like this:

// v2 syntax

Void PrintValues (Array

Conversion Guide: Transfer the program from managed C ++ to C ++CLI

9cbs