RAID Technology Gold Tutorial

zhaozj2021-02-16  59

RAID is called Redundant Array Of Disks, which is a "Independent Disk Redundant Array" (initially "cheap disk redundant array") abbreviation. In 1987, Patterson, Gibson and Katz were defined in an article in the University of California University. RAID array technologies allow a range of disk to be packetized to implement data redundancy required for data protection, and data strip distributions formed to improve read and write performance. RAID originally used in the high-end server market, but with the rapid development of computer technology, RAID technology has penetrated into various fields of computer all over. Today, in the household computer motherboard, the RAID control chip is also visible. Generally, the RAID system can exist in various interface interfaces, as we are currently, PATA, SATA, and SCSI have corresponding hard drives to form RAID. With the release of the Intel 865/875 series chipset, the hard disk interface of the home market begins to switch SATA, while the RAID method will also transition from PATA to SATA. RAID technology has experienced a series of changes and development experience with people's use. In the household market, we can only see the RAID 0, RAID 1, and RAID 0 1 of these disk array. However, from the birth of the DFI LANPARTY motherboard, we ushered in the fourth disk array, that is, RAID 1.5. From practical applications, most of the formation of home RAID is to further improve the read and write performance of the disk, and the data backup can be reached in other ways (such as burning). Therefore, in the case of only 2 hard drives, people are willing to try RAID 0, but the birth of RAID 1.5 makes us change this concept. What is the performance of these two relatively inexpensive disk arrays? Let us notice everyone.

RAID 0: RAID 0 uses a technology called "striping" to distribute the data to each disk. Every "strip" is dispersed to a continuous "block" (block), and the data is divided into several blocks from 512 bytes to Duzab, and then write to the disk. The first block is written to the disk 1, and the second block is written to the disk 2, so that this is pushed. When the system reaches the last disk in the array, it is written to the next segment of the disk 1, so.

Segmentation data can assign an I / O load average into all drives. Since the driver can be written or read simultaneously, performance is significantly improved. However, it has no data protection capabilities. If a disk is malfunction, the data will be lost. Therefore, RAID 0 does not apply to a critical task environment, but it is very suitable for video, image production and editing.

RAID 1: RAID 1 is also known as a mirror, because the data on a disk is completely copied to another disk. If the data of a disk is incorrect, or the hard disk has a bad track, then another hard disk can rescue data loss and system interrupts caused by the disk failure. In addition, RAID 1 can also achieve duplex - you can copy the entire controller so that your data can be protected when a disk failure or controller failure occurs. The disadvantage of mirroring and duplex is to need more than one count of drives to copy data, but the system's read and write performance will not increase, which may be a small expense. RAID L can be implemented by software or hardware.

RAID 2: RAID 2 is a sea-coded check disk array developed for large machines and supercomputers. The first, second, fourth, 4th, 4th ... of the second N times of the disk drive group is a special check-in, for check and error correction. As shown below: RAID 2, 1, 2, 4 disk drives (red) of seven disk drives are the error correction disc, and the remaining (purple) is used to store data. RAID 2 has a very high performance of the reading and writing of large data, but the performance of a small amount of data is not good, so RAID 2 actually uses less. Due to the particularity of RAID 2, as long as we use the disk drive, the less the percentage of the verification disk is in it. If you want to achieve a relatively ideal speed and better disk utilization, it is best to increase the hard disk that saves the check code ECC code, but this will pay more of the purchase cost of the hard disk to ensure data redundancy. For the design of the controller, it is simpler than the RAID 3, 4 or 5 described below.

RAID 3: RAID 3, which is a band with a special parity (Parity). Each strip piece is equivalent to a "block" that is so large to store redundant information, ie phenomenon. The parity bit is encoded information. If the data of a disk is incorrect, or the disk is faulty, it can be used to recover data. In a data-intensive environment or a single user environment, the establishment of RAID 3 is advantageous to access longer, but the performance will decrease when accessing shorter records, like RAID 2.

RAID 4: RAID 4 is a stand-alone disk structure with parity code. It is very similar to RAID 3, and the difference is that RAID 4 access to data is performed by data block. RAID 3 is a horizontal strip, while RAID 4 is a vertical bar. So RAID 3 often allows all hard drives in the array, while RAID 4 is only required to access the useful hard drive. The speed of reading data is greatly improved, but in terms of write data, the old data to be restored from the data hard drive and verify the hard drive is required, and then write the updated data and inspection blocks. The hard drive, so the processing time is longer than RAID 3.

RAID 5: RAID 5 is also called a band with distributed parity. Each strip is equivalent to a "block" that is used to store the odd bits. Unlike RAID 3, RAID 5 is also distributed on all disks in all disks, which is not a disk, greatly reduces the burden of parity discs. Although there are some capacity losses, RAID 5 can provide a more perfect overall performance, which is also a disk array scheme that is widely used. It is suitable for input / output intensive, high read / write ratios, such as transaction processing, etc.

RAID 6: RAID 6 is a stand-alone disk structure with two distribution stored parity code. It uses a second parity assigned on different disks to achieve enhanced RAID 5. It can withstand multiple drivers at the same time, but it is more time to calculate the time spent on parity and verification data, which causes the system's load, which greatly reduces the overall disk performance, and the system needs one Extremely complex controller. Of course, since the second parity value is introduced, we need N 2 disks.

RAID 7: RAID 7 own with intelligent real-time operating system and software tools for storage management, can be run independently of the host, do not occupy the host CPU resources. RAID 7 Storage Computer Operating System is a real-time event-driven operating system, primarily for system initialization and arrange all data transfer of the RAID 7 disk array and convert them to the corresponding physical storage drive. To set and control the read / write speed via Storage Computer Operating System, you can make the host I / O pass performance optimally. If a disk is faulty, the recovery operation can be automatically executed, and the reconstruction process of the backup disk can be managed. RAID 7 uses non-synchronous access methods, which greatly reduces the data writing bottleneck and increases I / O speed. (The so-called unconductive access, that is, each I / O interface of RAID 7 has a dedicated high-speed channel, as a traffic path for data or control information, so it can independently control the data access of each disk in your own system.) If RAID 7 has n disks, remove a check disk (used as redundant calculations), can handle the read / write instructions randomly emitted at the same time simultaneously, thereby significantly improvement I / O applications. The RAID 7 system has a built-in real-time operating system that automatically optimizes the read / write instructions sent by the host. The data that may be read in advance will be read in advance, which greatly reduces the number of rotations of the magnetic head. Improve the I / O speed. RAID 7 helps users have effectively managed increasingly large data storage systems, and increase the operational efficiency of the system to at least double, meeting the different needs of various users.

RAID 10 (RAID 0 1): RAID 10, also known as the mirror array strip, now we generally call it RAID 0 1. RAID 10 (RAID 0 1) provides 100% data redundancy, supporting larger volume sizes. To form a RAID 10 (RAID 0 1) requires 4 disks, two of which provide the band data distribution, which provides the read and write performance of RAID 0, and the other two is the image of the front two hard drives, ensuring the complete data. Backup.

RAID 30: RAID 30 is also known as a dedicated parity array strip. It has the characteristics of RAID 0 and RAID 3, composed of arrays from two sets of RAID 3 disks (3 disks per group), using dedicated parity, and these two disks have a RAID 0 array to implement cross-disk extraction data . RAID 30 provides fault tolerance and supports larger volume sizes. Like RAID 10, RAID 30 also provides high reliability because there are two physical disk drives fail (one in each array), and data is still available.

The RAID 30 minimal requires 6 drives, which is best for non-interactive applications, such as video streams, graphics, and image processing. These applications processes large files and require high availability and high speed.

RAID 50: RAID 50 is called a distributed parity array strip. Immortal with RAID 30, it has the common characteristics of RAID 5 and RAID 0. It consists of two sets of RAID 5 disk (at least 3 each), each group uses a distributed odd-even bit, and the two sets of hard drives are set to RAID 0, and data is extracted across disk. RAID 50 provides reliable data storage and excellent overall performance and supports larger volume sizes. Even if the two physical disks have failed (one in each array), the data can also recover smoothly.

RAID 50 requires a minimum of 6 drives, which is best suited for applications that require high reliability storage, high read speed, high data transmission performance. These applications include transaction and office applications with many users to access small files. RAID 53: RAID 53 is called an efficient data transfer disk structure. The implementation of the structure with the Level 0 data strip array, where each section is a RAID 3 array. Its redundant and fault-tolerant ability with RAID 3. This is beneficial for systems that require high data transfer rates, but it is expensive and low efficiency.

RAID 1.5: RAID 1.5 is a new disk array method, which has the characteristics of RAID 0 1, and the difference is that its implementation requires only 2 hard drives. From the surface, the disk after RAID 1.5 is formed, and both have the same data. Of course, RAID 1.5 is also a disk array mode that cannot fully utilize disk space, so two 80GB hard drives are the same as RAID 1.5, that is, only 80GB actual use space, and another 80GB is it. Backup data. If the two hard drives are separated, they run them in the original system, and they are unobstructed. However, by practical applications, we found that if the two hard drives are running separately, the slight changes in their data will cause the reorganized disk array, and complete data recovery cannot be achieved, but is subject to less data.

Since RAID 1.5 and RAID 1 have a very similar effect, how is it experimenting with RAID 0 strip read and write operations? So far, we have not yet true materials prove that the following imaginary: The disk array control chip has advanced control functions, allowing two disks to record the same data in the way, but when reading, control But you can distinguish between the program strips that need to be read, and then read different strips from different hard disks, respectively, to improve performance RAID 0 effects. In fact, for home users, most of the implementation of RAID 1.5 is still improved in performance, after all, RAID 1.5 has the characteristics of RAID 0. Therefore, this time we make a comparison test for the performance of RAID 1.5. Below, let's take a look at RAID 1.5 performance: RAID level RAID 0 RAID 1 RAID 0 1 RAID 1.5 name strip mirror strip mirror strip mirror allowable, no copy copy copy hot standby Operation can be possible with a number of hard drives by more than 2 4 2 available capacity maximum minimum intermediate reduction capacity without 50% 2 disk 1 disk read performance high (the number of discs) in the middle, high random write performance Intermediate High Continuous Writing Performance Maximum Intermediate High Typical Application No fault quickly reading and writing allows faulty small files, random data writes small files that allow faults, continuous data transfer allows faults, random data transmission performance test: Hardware platform CPU Intel Pentium4 3.0GHz (Northwood, 800MHz FSB) graphics card Qaheng Gaji TI4680 VIVO driver: NVIDIA DETONATOR 4 V43.50 Memory Kingmax DDR400 DDRRRAM 256MB × 2 Memory Parameters Unified Setting in DDR 400 AUTO Mode Hard Disk Cool Fish V SATA

120g

(7200 rpm, 8MB cache) × 2 Gold Diamond 9 PATA

80g

(7200RPM, 2MB Cache) × 2 Driver: Intel IAA 3.0 with RAID Motherboard MSI (MSI) 875P NEO DFI (Diamond) LANPARTY PRO875 PS: Plate Sound Card, NIC Equipment All Turn Drivers: Intel Chipset Software V 5.00 Operating System Windows XP Professional SP1 English version DirectX DirectX 9.0 Software Platform Test Software ZDNET Winbench 99 2.0 Sisoft Sandra 2003 HDtach 2.61 This test The patr RAID 1.5 built by the motherboard onboard is compared. The hard disk type used by the SATA disk array and the PATA disk array is different, so the test score is for reference only. Winbench 99: Testing through WinBench 99, we can understand that after the establishment of RAID, whether it is commercial disk performance or high-end disk performance, it has been greatly improved. The RAID 1.5 composed of two gold diamonds is the most powerful in commercial disk performance, and the high-end disk performance is headed by the SATA disk array controlled by ICH 5 RAID 0, and the results are quite gratifying. In order to further compare the difference in PATA RAID 1.5 and SATA RAID 0, we did the following comparative tests:

Highpoint Pata Raid 1.5

Promise 378 SATA RAID 0

Through the comparison of two RAID modes, we found that RAID 1.5 system scores built using Diamondmax 9 Plus hard drives, the disk transfer rate can reach 95kb / s, and the average transmission rate is also close to 70kb / s . During data transmission, the volatility of its transmission rate is relatively regular, and the transmission performance is stable. As an excellent new generation disk interface, SATA, of course, has also shown extraordinary strength. The RAID 0 system built by two baracuda V (Cool 5 generation) has no RAID 1.5 performance in the highest transmission rate of the disk, but can also exceed 90kb / s, but the minimum transmission rate is higher than the RAID 1.5. The distance, the average transfer rate is more than RAID 1.5. Sisoft Sandra 2003: In the disk performance test of Sisoft Sandra 2003, we all use an empty disk partition as a target disk, which is

20g

. The RAID 1.5 system built by PATA is not as good as in the WinBench 99 test, and the SATA RAID 0 composed of two control chips is far away.

Highpoint Pata Raid 1.5

Promise Sata Raid 0

ICH 5 SATA RAID 0 HD TACH 2.61:

Highpoint Pata Raid 1.5

Promise Sata Raid 0

Ich5 SATA RAID 0 In HD TACH 2.61 test, RAID 1.5 re-exhibits the greenery, and the average 47.8Mb / s read speed makes an average of 31.1Mb / s SATA RAID 0 headed down. However, in terms of CPU usage, SATA RAID 0 is significantly smaller than PATA RAID 1.5, whose minimum is the RAID 0 system controlled by ICH5, only 1.2%. However, it is that the RAID 1.5 system with the highest CPU usage is only 2.8%. It is also insignificant for mainstream machine configuration today more than 2GHz. In the Sysmark 2002 test, we found that the performance of the whole machine after the establishment of RAID is not as big as the imagination, but there will be 5% improvement in office performance. The RAID 1. RAID 0 built in the PATA hard drive is not large.

However, in the PCMark 2002 test, RAID 1.5 has once again exceeds RAID 0 levels and makes people have to admire.

Summary :RAID is a long history of historical disk array technology. Its birth is not only to ensure that our important data is safe for a long time in redundant disk arrays, or break through the bottleneck and improve performance. So far, RAID technology has been applied in various fields penetrated by the computer, and the home computer market, this has been a market that is considered by RAID R & D staff, has also been a result of several years. In the past, RAIDs formed by PATA hard drives in the home market usually only RAID 0, RAID 1 and RAID 0 1, but today we not only witnessed the birth of RAID 1.5, but also felt the unique charm of RAID 1.5. Compared to the RAID 0 system built by the mainstream SATA hard disk, the RAID 1.5 built by the PATA hard drive is not a bit of weakness. In contrast, RAID 1.5 seems to be the most effective way to improve performance and ensure data security.

转载请注明原文地址:https://www.9cbs.com/read-18287.html

New Post(0)