CN103051676A - Distributed data storage management method - Google Patents

Distributed data storage management method Download PDF

Info

Publication number
CN103051676A
CN103051676A CN2012104863468A CN201210486346A CN103051676A CN 103051676 A CN103051676 A CN 103051676A CN 2012104863468 A CN2012104863468 A CN 2012104863468A CN 201210486346 A CN201210486346 A CN 201210486346A CN 103051676 A CN103051676 A CN 103051676A
Authority
CN
China
Prior art keywords
data
layer
storage
management method
distributed
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN2012104863468A
Other languages
Chinese (zh)
Inventor
平原
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Inspur Electronic Information Industry Co Ltd
Original Assignee
Inspur Electronic Information Industry Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Inspur Electronic Information Industry Co Ltd filed Critical Inspur Electronic Information Industry Co Ltd
Priority to CN2012104863468A priority Critical patent/CN103051676A/en
Publication of CN103051676A publication Critical patent/CN103051676A/en
Pending legal-status Critical Current

Links

Images

Abstract

The invention provides a distributed data storage management method, belonging to the field of computer communication. The distributed data storage management method comprises the following concrete steps: data is generated by a data source layer; the generated data enters into a data processing layer; segmentation data processed by the data processing layer enters a data transmission layer; the segmentation data is packaged into a data pocket by the data transmission layer; the data packet is transmitted to a data storage layer through an SCSI (small computer system interface) protocol or an IP (internet protocol); the received data packet is unpacked and stored by the data storage layer; the data storage layer refers to a plurality of sub storage array nodes; and a data segmentation is stored in each storage array node. With the adoption of the distributed data storage management method, compared with the prior art, continuously increased user data is stored by utilizing a distributed RAID (redundant array of inexpensive disk) storage array, so that data failure risks are effectively reduced, the data restoring time is reduced, the storage capacity is conveniently expanded, the cost is reduced, and a large amount of maintaining cost is saved.

Description

A kind of Distributed Storage management method
Technical field
The present invention relates to the computer communication technology field, specifically a kind of Distributed Storage management method.
Background technology
Along with the Internet era the data explosion formula increase, the general data center is used for the capacity of hard disk of storage data in continuous growth, increasing data are stored in same hard disk or the hard disk permutation, but the hardware fault failure rate of hard disk parts does not decrease.The failure risk of the unit data that this means is continuing to increase, and the time that data are recovered, user's data reliability was faced with more and more higher challenge worse and worse.How effectively to reduce the data failure rate, be a great problem that computer communication technology faces.
Summary of the invention
Technical assignment of the present invention is to solve the deficiencies in the prior art, and a kind of Distributed Storage management method is provided.
Technical scheme of the present invention realizes in the following manner, this a kind of Distributed Storage management method, and its specific implementation step is:
1) produce data by the data source layer, the application program that this data source layer is the user produces Data Source;
2) data that produce in the step 1) enter data analysis layer, and this data analysis layer carries out piecemeal with file data to be processed, and described data analysis layer comprises EC algorithm control end and processes initial data;
3) block data processed of data analysis layer enters data transfer layer, and this data transfer layer is packaged into packet with block data, by SCSI agreement or IP agreement data packet transmission in data storage layer;
4) described data storage layer unpacks rear storage to the packet that receives, and this data storage layer refers to some sub-storage array nodes, and each storage array node is responsible for depositing a data piecemeal.
In the technique scheme, carry out Decentralization by related algorithm with being stored in the unified physics position data in the conventional architectures, by erasure code algorithm file data is carried out piecemeal and process, the block data distributed store that will process by SCSI or IP packet again is in a plurality of subarray nodes.
Relatively independent physically between the sub-memory node of described data storage layer, namely every individual sub-storage array node has the data redundancy protection method of oneself, can independently carry out data protection, data backup, data recovery.Block data in this technical scheme is stored in a plurality of subarray nodes, and each subarray node only is responsible for its corresponding data of storage, and isolation is independent relatively physically for each subarray node simultaneously.The index data of the block data that generates by algorithm carries out logical order, logical volume management with the subarray node.Each subarray all has certain data protection and recovery capability, can effectively reduce the data failure rate.
The configuration mode that this patent proposes, the subarray node can be configured to different capacity, different RAID rank (such as RAID0, RAID1, RAID5 etc.), different sub-storage array nodes all is according to the significance level of data bulk and the configuration of carrying out data array of scale, therefore can ensure the optimization of the cost of server hardware configuration, can give full play of the application advantage that each is stored in a plurality of subarray nodes, thereby effectively reduce the data failure risk, also realize the optimization of cost.
Described data analysis layer also comprises Meta Data server, and this Meta Data server is with the index data duplicated record.
The beneficial effect that the present invention compared with prior art produces is:
A kind of Distributed Storage management method of the present invention is for the threat of ever-increasing data failure rate risk and design, by the data distribution management, be distributed in a plurality of RAID arrays originally being stored in a data communication device in the RAID array, utilize distributed RAID storage array to store ever-increasing user data, effectively reduce the data failure risk, reduce data recovery times, and be convenient to the expansion of memory capacity, realize cost, save a large amount of maintenance costs.
Description of drawings
Accompanying drawing 1 is data storage of hierarchically formula schematic diagram of the present invention.
Embodiment
Below in conjunction with accompanying drawing a kind of Distributed Storage management method of the present invention is described in detail below.
As shown in Figure 1, this a kind of Distributed Storage management method, it is comprised of four parts, is specially:
The data source layer: user's application program produces Data Source.
Data analysis layer: comprise EC algorithm control end and process initial data, Meta Data service end archive index data.
Data transfer layer: initial data, the checking data piecemeal processed by the SCSI/IP agreement are distributed to each sub-storage array node.
Data storage layer: by each sub-storage array node respective data blocks of arriving of storing received respectively, finish the data storage protection.
Its specific implementation step is:
1, according to user's actual application environment, the storage classes such as quantity, amount of capacity of namely storing subarray generate the related algorithm parameter.According to relevant parameter, the erasure code(EC of configuration modification Control Server end data processing layer) the algorithm process block algorithm.
2, the initial data that namely will store by the user application acquisition at Control Server data source layer by the EC algorithm process, is block data, checking data and index factor data with original data processing.
3, algorithm arranges redundancy protecting quantity, for example initial data can be divided into M data piecemeal, K checking data piece, altogether N data block (M=N+K).Can accomplish to have among the M any N data block can recover initial data, encoding rate is N/ (N+M).
4, the block data after the data transfer layer of Control Server will be processed, checking data carry out data encapsulation and become packet, by SCSI or IP agreement, data packet transmission are carried out distributed store to a plurality of sub-storage array nodes.Each sub-storage array node is responsible for depositing a data piecemeal separately.
5, since Control Server in index data of crucial importance, so system with the index data duplicated record in the Meta of data analysis layer data server, be convenient to recovery, tissue, the inquiry of follow-up data.
6, after sub-storage array node receives packet, carry out packet and unpack.Data after unpacking are stored in its storage array of independently controlling.Utilize the array data redundant correcting characteristic of himself, data are carried out second layer protection.Because relatively independent in the son storage permutation is a memory cell, all possess certain data protection ability, and also relatively independent when each comfortable data is recovered, greatly save the data recovery time when hard disk failure occurring, save a large amount of maintenance costs.

Claims (3)

1. Distributed Storage management method, it is characterized in that: its specific implementation step is:
1) produce data by the data source layer, the application program that this data source layer is the user produces Data Source;
2) data that produce in the step 1) enter data analysis layer, and this data analysis layer carries out piecemeal with file data to be processed, and described data analysis layer comprises EC algorithm control end and processes initial data;
3) block data processed of data analysis layer enters data transfer layer, and this data transfer layer is packaged into packet with block data, by SCSI agreement or IP agreement data packet transmission in data storage layer;
4) described data storage layer unpacks rear storage to the packet that receives, and this data storage layer refers to some sub-storage array nodes, and each storage array node is responsible for depositing a data piecemeal.
2. a kind of Distributed Storage management method according to claim 1; it is characterized in that: relatively independent physically between the sub-memory node of described data storage layer; be the data redundancy protection method that every sub-storage array node has oneself, can independently carry out data protection, data backup, data recovery.
3. a kind of Distributed Storage management method according to claim 1, it is characterized in that: described data analysis layer also comprises Meta Data server, this Meta Data server is with the index data duplicated record.
CN2012104863468A 2012-11-26 2012-11-26 Distributed data storage management method Pending CN103051676A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN2012104863468A CN103051676A (en) 2012-11-26 2012-11-26 Distributed data storage management method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN2012104863468A CN103051676A (en) 2012-11-26 2012-11-26 Distributed data storage management method

Publications (1)

Publication Number Publication Date
CN103051676A true CN103051676A (en) 2013-04-17

Family

ID=48064170

Family Applications (1)

Application Number Title Priority Date Filing Date
CN2012104863468A Pending CN103051676A (en) 2012-11-26 2012-11-26 Distributed data storage management method

Country Status (1)

Country Link
CN (1) CN103051676A (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104683472A (en) * 2015-03-12 2015-06-03 浪潮集团有限公司 Data transmission method capable of supporting large data volume
CN106686117A (en) * 2017-01-20 2017-05-17 郑州云海信息技术有限公司 Distributed calculation cluster data storage processing system and method
CN106873911A (en) * 2017-02-10 2017-06-20 济南浪潮高新科技投资发展有限公司 The implementation method of Distributed Storage is carried out using Classfication of containers pattern
CN109241023A (en) * 2018-09-21 2019-01-18 郑州云海信息技术有限公司 Distributed memory system date storage method, device, system and storage medium
CN113448770A (en) * 2020-03-27 2021-09-28 伊姆西Ip控股有限责任公司 Method, electronic device and computer program product for recovering data

Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101676855A (en) * 2008-09-11 2010-03-24 美国日本电气实验室公司 Scalable secondary storage systems and methods
CN101834899A (en) * 2010-04-29 2010-09-15 中科院成都信息技术有限公司 Distributed adaptive coding and storing method
US20110213928A1 (en) * 2010-02-27 2011-09-01 Cleversafe, Inc. Distributedly storing raid data in a raid memory and a dispersed storage network memory
CN102520890A (en) * 2011-12-30 2012-06-27 北京天地云箱科技有限公司 RS (Reed-Solomon) - DRAID( D redundant array of independent disk) system based on GPUs (graphic processing units) and method for controlling data of memory devices

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101676855A (en) * 2008-09-11 2010-03-24 美国日本电气实验室公司 Scalable secondary storage systems and methods
US20110213928A1 (en) * 2010-02-27 2011-09-01 Cleversafe, Inc. Distributedly storing raid data in a raid memory and a dispersed storage network memory
CN101834899A (en) * 2010-04-29 2010-09-15 中科院成都信息技术有限公司 Distributed adaptive coding and storing method
CN102520890A (en) * 2011-12-30 2012-06-27 北京天地云箱科技有限公司 RS (Reed-Solomon) - DRAID( D redundant array of independent disk) system based on GPUs (graphic processing units) and method for controlling data of memory devices

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
顾瑜 等: "带重复数据删除的大规模存储系统可靠性保证", 《清华大学学报(自然科学版)》 *

Cited By (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN104683472A (en) * 2015-03-12 2015-06-03 浪潮集团有限公司 Data transmission method capable of supporting large data volume
CN106686117A (en) * 2017-01-20 2017-05-17 郑州云海信息技术有限公司 Distributed calculation cluster data storage processing system and method
CN106686117B (en) * 2017-01-20 2020-04-03 郑州云海信息技术有限公司 Data storage processing system and method of distributed computing cluster
CN106873911A (en) * 2017-02-10 2017-06-20 济南浪潮高新科技投资发展有限公司 The implementation method of Distributed Storage is carried out using Classfication of containers pattern
CN109241023A (en) * 2018-09-21 2019-01-18 郑州云海信息技术有限公司 Distributed memory system date storage method, device, system and storage medium
CN113448770A (en) * 2020-03-27 2021-09-28 伊姆西Ip控股有限责任公司 Method, electronic device and computer program product for recovering data

Similar Documents

Publication Publication Date Title
US11762781B2 (en) Providing end-to-end encryption for data stored in a storage system
US11581943B2 (en) Queues reserved for direct access via a user application
US11593036B2 (en) Staging data within a unified storage element
US20200364379A1 (en) Block-based compression
US10623386B1 (en) Secret sharing data protection in a storage system
US20210383878A1 (en) Conserving bandwidth using efficient relocation of data between storage devices
US11822807B2 (en) Data replication in a storage system
CN101727299B (en) RAID5-orientated optimal design method for writing operation in continuous data storage
US20200081648A1 (en) Local relocation of data stored at a storage device of a storage system
US20210273929A1 (en) ENCRYPTING DATA IN A NON-VOLATILE MEMORY EXPRESS ('NVMe') STORAGE DEVICE
US10810123B1 (en) Flush strategy for using DRAM as cache media system and method
WO2018231806A1 (en) Portable snapshot replication between storage systems
WO2018231350A1 (en) Accessible fast durable storage integrated into a bulk storage device
CN104583930B (en) Method, controller and the data migration device of Data Migration
US20210011647A1 (en) Identifying and relocating hot data to a cache determined with read velocity based on a threshold stored at a storage device
US11194759B2 (en) Optimizing local data relocation operations of a storage device of a storage system
CN103793182A (en) Scalable storage protection
US20200134202A1 (en) Efficient rekey in a transparent decrypting storage array
CN103051676A (en) Distributed data storage management method
US11487665B2 (en) Tiered caching of data in a storage system
US11899582B2 (en) Efficient memory dump
CN102184079B (en) Method for optimizing writing performance of redundant array of independent disk (RAID5)-grade disk array
WO2018112980A1 (en) Storage controller, data processing chip, and data processing method
US11494109B1 (en) Erase block trimming for heterogenous flash memory storage devices
CN116601596A (en) Selecting segments for garbage collection using data similarity

Legal Events

Date Code Title Description
C06 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
WD01 Invention patent application deemed withdrawn after publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20130417