.NET big file upload knowledge finishing

xiaoxiao2021-03-06  73

Recently doing the Epartner project, involving issues uploaded by documents. I have been uploaded before, but I'm all small documents, no more than

2M

. This time I ask for uploading

100M

The above things. There is no way to find the information. WEB-based file uploading can use FTP and HTTP protocols, although the transfer is stable, but security is a serious problem, and the FTP server reads the user library to obtain permissions, which is not very convenient for the user. There is only HTTP left. There are 3 ways, PUT, WebDAV, RFC1867, and the first two methods are not suitable for large files. We currently Uploads we use the RFC1867-based HTML to upload files.

First, first briefly introduce the RFC1867 (Form-Based File Upload In HTML) standard:

1. HTML form with file submission function

The existing HTML specification defines eight possible values ​​for the INPUT element. They are: Checkbox, Hidden, Image, Password, Radio, Reset, Submit, Text. In addition, the form is default when the form uses the POST mode. ENCTYPE attribute with "Application / X-WWW-FORM-URLENCODED".

The RFC1867 standard made two modifications to HTML:

1) Add a FILE option to the Type property of the INPUT element.

2) The Input tag can have an Accept property that specifies a list of file types or file formats that can be uploaded.

In addition, this standard also defines a new MIME type: Multipart / Form-Data, and a tag that is processed with ENCTYPE = "Multipart / Form-Data" and / or contains The behavior should be taken when the form is.

For example, when the HTML form author wants the user to upload one or more files, he can write:

File to process:

The change you need to do in the HTML DTD is an option for the InputType entity. In addition, we also recommend using a series of comma-separated file types as an INPUT tag an Accept property.

... (other elements) ...

Radio | Submit | RESET |

Image | hidden | file)>

TYPE% InputType Text

Name CData #implied - Required for All But Submit and Reset

Value cdata #implied

SRC% URI # IMPLIED - for Image Inputs -

Checked (Checked) # Implied

Size cdata #implied --Like Numbers, But Delimited with Comma, Not Space

Maxlength Number #implied

Align (Top | Middle | Bottom) #implie

Accept cdata #implied --List of content type

>

... (other elements) ...

2. Document transmission delay

In some cases, the server is a recommended approach to certain elements (such as user name, account, etc.) in form data before being prepared to accept data. However, after a certain consideration, we think that if the server wants to do this, it is best to use a series of forms and transfer the data elements you have verified as the "hidden" field to the client, or by schedule The form makes those elements that need to be verified first. In this case, those servers that need to be complex applications can maintain their own state, and those simple applications can be made simple.

The HTTP protocol may need to know the total length of content in the entire transaction. Even if there is no clear requirement, the HTTP client should also provide the total length of all files, such a busy server can determine if the contents of the file are too large to process, thereby returning an error code and shutting down This connection does not have to wait until all data is accepted. At present, some existing CGI applications need to know the total length of content for all POST transactions.

If the INPUT tag contains a MaxLength property, the client can regard this property value as the maximum number of bytes that the server can accept. In this case, the server can prompt the client on the server before the start of the server can be used to upload. But it should be noted that this is just a prompt, and the actual needs of the server may change before the form is created and uploaded.

In any case, if the received file is too large, any HTTP server may interrupt the transfer during file transfer.

3. Other solutions to transfer binary data

Some people have suggested using a new MIME type "aggregate", such as aggregate / mixed or content-transfer-encoding "package" to describe the binary data that is not determined, not by decomposition into multiple parts. . Although we don't object to do, this requires additional design and standardization work to let everyone accept and understand "aggregate". From another aspect, the mechanism of "decomposition is multi-part" work is well, which is very simple to implement in the customer sender and server acceptance, and can be as highly efficient as other integrated binary data. jobs.

4. Example

Suppose the server segment is provided with the following HTML:

ENCTYPE = "Multipart / Form-Data"

Method = POST>

What is your name?

What Files Are you sending?

Users fill in "Joe Blow" in the Name field, to the problem 'What files are you sending?', Users choose

A text file "file1.txt".

Customer segment may send back the following data: Content-Type: Multipart / Form-Data, Boundary = AAB03X

--Aab03x

Content-disposition: form-data; name = "field1"

Joe Blow

--Aab03x

Content-disposition: form-data; Name = "pics"; filename = "file1.txt"

Content-Type: Text / Plain

... File1.txt content ...

--Aab03x -

If the user also selected another picture file "file2.gif", the data that the client may send will be:

Content-Type: Multipart / Form-Data, Boundary = AAB03X

--Aab03x

Content-disposition: form-data; name = "field1"

Joe Blow

--Aab03x

Content-disposition: form-data; name = "PICS"

Content-type: Multipart / Mixed, Boundary = BBC04Y

--Bbc04y

Content-disposition: attachment; filename = "file1.txt"

Content-Type: Text / Plain

... File1.txt content ...

--Bbc04y

Content-disposition: attachment; filename = "file2.gif"

Content-Type: Image / GIF

Content-Transfer-Encoding: binary

... File2.gif's content ...

--Bbc04y--

--Aab03x -

Second, using the RFC1867 standard processing file upload:

1. The uploaded data is obtained in one time, then analyze the processing.

After seeing N multi-code, there is discovered that there is currently no component program and some COM components are used using the Request.BinaryRead method. The uploaded data is obtained in one time, then analyze the processing. That's why the big file is very slow, IIS timeout does not say, even if there are hundredsm files, the analysis process has to be a while.

2. While receiving the file, write a hard drive.

Learn about foreign business components, more popular with Power-Web, Aspupload, ActiveFile, Abcupload, Aspsmartupload, Sa-fileup. Among them, the relatively excellent is aspupload and sa-file, they know that they can handle

2G

The file (SA-File E version does not even limit the size of the file), and the efficiency is also very good. Is there so much efficiency in the programming language? I checked some information, I feel that they are directly operating the file stream. This is not restricted by the size of the file. But the foreigners are not absolutely perfect, and after the AspupLoad handles large files, the memory occupation is amazing.

1G

It is generous to normal. As for SA-File, although it is good, it is hard to find. Then found 2 .NET upload components, lion.web.uploadModule and ASPNETUPLOAD are also operating file streams. But upload speed and CPU usage are not as good as foreigated business components.

Made a test, Lan internal transmission

1G

document. Aspupload upload speed average

4.4M

/ S, CPU takes up 10-15, memory occupation

700M

. SA-File is also almost like this. And ASPNETupload is the fastest

1.5M

/ s, average is 700k / s, CPU takes up 15-39, test environment: PIII

800,256m

RAM,

100M

LAN. I think the ASPNETUPLOAD is slow to write the hard drive because of the reception file. The cost of resource occupation is to reduce the transmission speed. But I have to admire foreigners, the CPU is so low .....

Third, the problem encountered by the ASP.NET upload file

We have encountered such or such problems when using major files on ASP.NET. Setting big maxRequestLength values ​​do not completely solve the problem, because the ASP.NET will block until the entire file is loaded into memory. In fact, if the files are large, we often see Internet Explorer display "The page cannot be displayed - cannot Find Server or DNS ERROR", it seems to be Catch can't. why? Because this is a Client Side error, the Application_ERROR of the Server Side end is derived.

Fourth, ASP.NET big file upload solution

The solution is to use an implicit HttpWorkerRequest to read the data from the PIPE established by IIS as ASP.NET with its GetPreLoadedEntityBodyRequest. Chris Hynes provides us with such a solution (with httpmodule), which can display upload progress in real time in addition to allowing you to upload large files.

Lion.Web.uploadModule and ASPNETUPLOAD two .NET components are using this solution.

Principle:

Using the HTTPHandler implements the functionality of the ISAPI Extens, the request (REQEST), and the response.

Program points:

1. Httphandler or httpmodulea. Intercept request object B. Brake read and write data c. Real-time tracking upload schedule update meta information

2. HttpWorkerRequest use implied by its GetPreloadedEntityBody ReadEntityBody and method for processing stream file IServiceProvider provider = (IServiceProvider) HttpContext.Current; HttpWorkerRequest wr = (HttpWorkerRequest) provider.GetService (typeof (HttpWorkerRequest)); byte [] bs = wr.GetPreloadedEntityBody ( ); .... if (! Wr.isentirentityBodyispreLoaded ()) {INT N = 1024; Byte [] bs2 = new byte [n]; while (wr.readentityBody (BS2, n)> 0) {.... }

3. Custom Multipart MIME parser automatically intercepting MIME splitting files to write files such as temporary files

Real-time update Appliaction status (ReceivingData,

Error,

Complete

)

转载请注明原文地址:https://www.9cbs.com/read-106521.html

New Post(0)