WO2007021289A1 - Disaggregated resources and access methods - Google Patents

Disaggregated resources and access methods Download PDF

Info

Publication number
WO2007021289A1
WO2007021289A1 PCT/US2005/036022 US2005036022W WO2007021289A1 WO 2007021289 A1 WO2007021289 A1 WO 2007021289A1 US 2005036022 W US2005036022 W US 2005036022W WO 2007021289 A1 WO2007021289 A1 WO 2007021289A1
Authority
WO
WIPO (PCT)
Prior art keywords
resource
node
disaggregated
map
consumer
Prior art date
Application number
PCT/US2005/036022
Other languages
French (fr)
Inventor
Thomas Earl Ludwig
Mark Adams
Original Assignee
Zetera Corporation
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zetera Corporation filed Critical Zetera Corporation
Publication of WO2007021289A1 publication Critical patent/WO2007021289A1/en

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L61/00Network arrangements, protocols or services for addressing or naming
    • H04L61/45Network directories; Name-to-address mapping
    • H04L61/457Network directories; Name-to-address mapping containing identifiers of data entities on a computer, e.g. file names
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L61/00Network arrangements, protocols or services for addressing or naming
    • H04L61/50Address allocation
    • H04L61/5007Internet protocol [IP] addresses
    • H04L61/5014Internet protocol [IP] addresses using dynamic host configuration protocol [DHCP] or bootstrap protocol [BOOTP]

Definitions

  • the field of invention is disaggregated resources especially storage devices, memories, displays, CPUs, or other computing resources.
  • Such fine level granularity access is difficult in a disaggregated resource because a resource consumer does not necessarily have a sufficiently detailed and complete map of the disaggregated resource.
  • the organization of the resource nodes can be quite complex depending on a number of factors including type of resource involved, the roles or responsibilities of each resource node, resource node location, or other implementation specific information.
  • operating systems have to interpret the disaggregated resource as if it were locally connected in order to provide applications seamless, transparent integration with the computing environment.
  • BitTorrentTM a peer-to-peer file transfer system
  • BitTorrentTM offers redundant file storage as a distributed resource where each resource node is complete computer system offering file storage.
  • BitTorrentTM places redundant copies of data on multiple peers to alleviate some of the risk of an unreliable network; however, the peers are outside the control of the ultimate user so the user does not know if a peer is lost. Any owner of a BitTorreiitTM system can take down their computer reducing the overall performance for a remote unknown user.
  • each BitTorrentTM user has a different view of the network based on connectivity, a user can not, in a deterministic fashion, determine the extent and usability of the system.
  • BitTorrentTM is an application level protocol and does not provide a transparent solution of a storage resource that allows an operating system or application to read and write data at will. Cleary, BitTorrentTM offers some utility for high level file transfers; it is not suitable for consumer or enterprise system due to the lack of control, determinism, fine level access, or performance. BitTorrentTM and other peer-to-peer systems have not addressed the need for resource consumers to understand a complete map for a coherent disaggregated resource at an elemental level.
  • Hitachi's U.S. Patent No. 4,890,227 offers a resource management system for operating systems of large scale computers.
  • the memory, CPUs, I/O channels, and storage devices represent disaggregated resources.
  • the management system relies on a set of policies that are continually updated and deleted to equitably and autonomously assign resources to process units (programs, threads, or tasks).
  • the Hitachi patent does not address issues of resource coherency over an unreliable communication path or issues of multiple resource consumers (process units) having different views of the resources because a centralized management system handles all the resources for the process units.
  • the Hitachi patent does not provide insight on how a resource consumer will manage and access resource nodes of a disaggregated resource that extend beyond the core computer. Even though the Hitachi patent addresses more elemental resources, the centralized resource management system does not allow resource consumers build their own view of the system or to function independently. Furthermore, the Hitachi patent offers no insight how to address fine level of structure of a resource.
  • the Microsoft patent uses an underlying statistical assumption regarding the probability of a first peer knowing a second peer's ID based on the "distance" between the first peer's ID and the second peer's ID.
  • the Microsoft patent offers an efficient peer-to-peer name resolution system which allows a peer to keep track of and to find other peers by a useable organization scheme resulting in a map of the system.
  • the structure only offers a way to access peers across a peer-to-peer network, but does not offer developers a way to access a fine level of detail within a peer as required by a truly disaggregated resource.
  • the Microsoft patent does not address the need for a resource consumer to know the coherency of the disaggregated resource as resource nodes lose connectivity.
  • Adaptec's U.S Patent No. 6,922,688 offers a method of accessing data objects where portions of the object are found through obtaining referential maps comprising logical storage locations and physical maps comprising the physical storage locations associated with the logical storage locations.
  • the patent teaches how to access data objects distributed across a plurality of physical locations, it does not enable disaggregated resources, how to access such resources, how to operate disaggregated resources as one functional whole resource, or how to maintain a disaggregated resource over an unreliable network.
  • one aspect of the present invention comprises a split map, that map is a split map of disaggregated resources not a split map of data objects.
  • the term resource excludes data objects.
  • a disaggregated resource would comprise independent, efficient resource nodes that do not necessarily communicate with each other and do not require out-of-band communications •
  • the resource nodes provide information about their role in the disaggregated resource to resource consumers that request the information
  • Resource consumers construct their own view of the disaggregated resource based on information from the resource nodes
  • a map of the resource provides access to a fine level of granularity to the resource at or below the resource device level
  • the present invention is directed toward disaggregated resources that are spread among a number of resource nodes.
  • the resource nodes provide information about the nodes to resource consumers who then create maps of the resource in order to properly access the resource.
  • the resource node information from a single resource node comprises insufficient information for a useable map of the disaggregated resource.
  • Resource node information from multiple resource nodes allows for the creation of a more complete, useable map from perspective of a resource consumer.
  • Another aspect of the invention provides for resource maps to be split among elements of the disaggregated resource and resource consumers in order to provide fine level access to the resource without the resource consumer storing an entire map of the resource.
  • Another aspect of the invention is directed toward methods for accessing the disaggregated resource through organizing resource nodes into the disaggregated resource, assigning resource node information, and assessing if the disaggregated resource has coherency.
  • the resource nodes respond to discovery messages by providing their resource node information from which the resource consumers construct a map on how to access the resource.
  • Resource consumers that wish to access the resource include resource consumers, resource managers, or other interested parities.
  • resource node information includes resource name, resource node name, resource node type, or resource attributes.
  • Resource consumers that interact with disaggregated resources use the resource name to identify a particular resource.
  • each resource node that composes the disaggregated resource includes a resource node name to differentiate themselves in cases where the nodes offer redundant physical resources. Through the use of names, the resource consumers select which resources are of interest and which resource nodes compose the disaggregated resource.
  • a resource node type provides resource consumers with the necessary information on how to access a specific node within the disaggregated resource.
  • Resource nodes function independently of each and focus only on their prescribed roles and responsibilities.
  • resource node attributes provide the resource consumers information on node state with respect to the disaggregated resource if resource state is critical relative to other nodes and provide resource node role information.
  • Contemplated attributes include coherency attributes that indicate when a disaggregated resource has lost coherency. If a disaggregated resource loses coherency, then one or more resource nodes have lost contact with active resource consumers; consequently, the disaggregated resource might not operate correctly or might not offer correct services.
  • Resource node information from an individual resource node comprises incomplete information for resource consumers to interact with the complete disaggregated resource.
  • a resource consumer collects resource node information from resource nodes to create a map of the disaggregated resource.
  • the resource map can be the same from one resource consumer to another or can be different from one resource consumer to another depending on what the resource consumers requires and how the disaggregated resource is configured. Furthermore, the resulting resource map has sufficient detail to allow a resource consumer to access the disaggregated resource.
  • the resource map provides a translation of a resource element ID to a physical resource address, location, or capability. It is specifically contemplated that the resource map splits into a number of resource sub-maps where each sub-map provides a further translation.
  • a resource consumer accessing a disaggregated resource creates a top level resource sub-map that comprises a translation of a resource element ID to an address of a resource node.
  • Each resource node also comprises a resource sub-map that translates a resource element ID to a physical resource ID. Consequently, it is contemplated that a resource map has a granularity below a resource unit level to a partition that represents some fraction of a physical resource. In addition, it is specially contemplated that a resource map has granularity below a partition level to an individual addressable physical resource element level, or even smaller.
  • Contemplated disaggregated resources include, but not limited to, computer related resources including storage, displays, CPU, input devices, or memory. Non-computer related resources are also contemplated including electrical power, or networking bandwidth. Specifically contemplated examples include disaggregated storage or disaggregated video displays. Furthermore, a disaggregated resource could comprise a heterogeneous mix of physical resources. For example, a number of disk drives, displays, CPUs, or memory where each individual item is disaggregated, then forms together into a complete disaggregated computer system.
  • Access Fault means an error condition raised by a resource node when a resource consumer has requested access to a resource beyond the capabilities of the resource node. When the consumer receives the access fault, the consumer reconsiders the request and then forms a more reasonable request, or takes other desired actions.
  • Resource means a fungible commodity required or desired by a resource consumer for the consumer's operation.
  • Resources include displays, I/O channels, data storage, network bandwidth, processor bandwidth, or other items used by a resource consumer even those that pertain to other systems beyond computing, for example electrical power.
  • Physical Resource means a physical instantiation of a resource at the device level. Examples of physical resources include video displays, hard disk drives, memory, processing units, media cards, network interfaces, power outlets, or other items used by resource consumers.
  • Resources have granularity; therefore, "resource partition” means a logical representation of a portion of a physical resource. For example, a resource partition of a hard disk drive includes a logical partition on the disk that is addressable external to the disk.
  • resource partition includes a range of pixels in an (x,y) coordinate system on a video display representing a logical frame that is addressable external to the display. Further granularity is possible; therefore, "resource element" means a natural, logical unit of the resource that can be addressed.
  • An example of a resource element for a storage system based on a hard disk drive includes a data block on the disk.
  • Yet another example of a resource element includes a pixel of a video display.
  • resource does not include a file or other data object because data objects are not fungible.
  • Resource Consumer means an entity that utilizes a commodity in order to deliver its desired functionality. Resource consumers have the ability to operate independently of other resource consumers. Further, resource consumers share physical resources though each resource consumer could have a completely different perspective of the resource. Example resource consumers include applications, computers, operating systems, file systems, users, management software, or other entities that have interest in the commodity.
  • Resource Map means a translation from a virtual addressing scheme to a physical addressing scheme of a physical resource.
  • Resource maps comprise software or data that translates a resource element ID into a physical address.
  • Resource maps can split into a number of resource sub-maps where each sub-map comprises a further translation to a final physical address.
  • Resource maps offer varying degrees of granularity of mapping virtual resource elements to physical locations. For example, a network storage system can offer a resource map with a granularity at the disk level, at the partition level, at the data block level, or even below the data block level by employing extra operations.
  • Split Resource Map means a resource map split into a plurality of resources sub-maps. The resource sub-maps reside in individual memories where necessary.
  • Resource Node means a logical construct executing on a processor comprising software or firmware that provides a representation of a resource partition.
  • Resource nodes comprise resource node information including and resource maps. Resource nodes have the ability to function independently of other resource nodes. This implies resource nodes do not require out-of-band management to communicate with other resource nodes in order to deliver a coherent disaggregated resource.
  • An example of a resource node is a combination of hardware, software, or firmware that functions as a logical partition of a disk. The logical disk partition accepts read or write requests based on a logical block address (LBA) of a data block, and then performs the requested operation on the disk.
  • LBA logical block address
  • Yet another example includes a combination of hardware, software, or firmware that functions as a logical video frame.
  • the logical video frame similarly accepts pixel display commands then instructs a monitor to modify the pixel within the bounds of the logical frame.
  • Resource nodes comprise a resource map to aid in the translation of a resource element ID, including an LBA or logical (x,y) coordinate of a pixel, to a physical address, a disk sector, track, cylinder, or (x,y) coordinate on a monitor.
  • Resource Node Attribute means an attribute within a resource node used by resource consumers external to the resource node to determine if the disaggregated resource suffers from a potential problem or to determine the resource node's role.
  • Resource node attributes include coherency attributes. If a coherency attribute or collection of coherency attributes indicates a disaggregated resource has lost coherency, a resource consumer can then determine the best alternative steps to deal with a potential problem.
  • Disaggregate resources access methods can be used to manage large arrays of storage devices, arrays of video displays, or other resources that lend themselves to disaggregation.
  • Figure 1 is a schematic of a disaggregated resource.
  • Figure 2 is a schematic of a possible physical embodiment for a resource node.
  • Figure 3 A is a schematic of an example of a parallel group of resource nodes composing a disaggregated resource.
  • Figure 3B is a schematic of an example of a serial group of resource nodes composing a disaggregated resource.
  • Figure 3C is a schematic of an example of a combination group of serial and parallel groups of resource nodes composing a disaggregated resource.
  • Figure 4A is a schematic of an example of group types for a disaggregated storage array.
  • Figure 4B is a schematic of an example of group types for a disaggregated video display.
  • Figure 5 A is a schematic for an example of a physical arrangement of logical disk partitions composing a disaggregated storage array representing a logical volume Z.
  • Figure 5B is a schematic for an example of the logical organization of logical disk partitions composing a disaggregate storage array representing the logical volume Z of Figure 5A.
  • Figure 6 is a schematic for a resource map.
  • Figure 7 is a schematic for a split resource map.
  • Figure 8 illustrates resource maps with varying degrees of granularity.
  • Figure 9 is a schematic for an example method of accessing a disaggregated resource.
  • the first example embodiment focuses on a disaggregated storage array comprising resource nodes in the form of logical disk partitions and the second embodiment focuses a disaggregate video display comprising resource nodes in the form of logical video display frames.
  • Figure 1 presents a schematic for a disaggregated resource comprising more than one resource node.
  • Disaggregated resource 100 comprises resource nodes HOA through 11OE. Although five resource nodes are shown, the number of resource nodes varies as necessary to fulfill the requirements of the disaggregated resource.
  • Resource consumers 140A through 140N communicate with disaggregated resource 100 through communication path 150 by communicating with resource nodes HOA through 11OE.
  • resource consumers 140A through 140N take the fomi of an electrical device running a driver on a processing unit.
  • the driver allows an application or applications running on the electrical device to interact with the resource provided by disaggregated resource 100 and resource nodes HOA through 11 OE in a manner that is transparent to the applications. Consequently, disaggregated resource 100 appears as a local resource from the perspective of the applications.
  • resource consumers include computers, operating systems, file systems, management software, or other entities that require access to disaggregated resource 100.
  • Resource consumers 140A through 140N interact with disaggregated resource 100 independently from each other. Therefore, each resource consumer has its own view of disaggregated resource 100. Resource consumers 140A through 140N can alter resource node information stored on resource nodes 11 OA through 11 OE. It is contemplated that altering resource node information provides for indicating changes in the disaggregated resource.
  • Communication path 150 provides for a sustained communication between resource consumers 140A through 140N and resource nodes HOA through 11 OE. It is specifically contemplated that communication path 150 is unreliable and could result in one or more of the resource nodes 11OA through 11OE losing connectivity with resource consumers 140A through 140N.
  • Contemplated communication paths include packet switched networks, computer busses, wireless transmission, or other communication mechanisms.
  • Figure 2 presents a schematic for a possible embodiment of a resource node.
  • Resource node 200 comprises processing unit 210 and memory 220.
  • Processing unit 210 communicates with other entities, including resource consumers, over communication path 250.
  • processing unit 220 communicates with physical resources 260A through 260M over physical resource communication path 215.
  • Processing unit 210 communicates with memory 220 over bus 225.
  • Memory 220 further comprises resource node information 230 and resource map 240.
  • Memory 220 comprises resource node instructions that run on processing unit 210, and data for use by processing unit 210 or resource consumers.
  • Data structures include resource node information 230, or resource map 240.
  • Resource node information 230 includes one or more information elements as indicated by resource info ⁇ nation elements 233A through 233N and conveys information regarding resource node 200 to resource consumers over communication path 250.
  • Contemplated examples of resource node information include resource name, resource node type, resource node name, resource node attribute, address, or other resource node information of interest to a resource consumer.
  • Resource consumers use resource node information to determine the role of a specific resource node within a disaggregated resource, amount other tilings.
  • Resource map 240 provides processing unit information on how to translate requests for resource access from resource consumers into an access of physical resource 260A through 260M. It is contemplated that resource map 240 provides information on translating resource consumer requests across one or more physical resources, at a partition level of a physical resource, or at levels smaller than a partition. Resource node 200 uses resource map 240 to aid in determining the responsibility of resource node 200 with regards to physical resources 260A through 260M.
  • Contemplated embodiments of resource node 200 include modules comprising a combination of hardware, software, or firmware. Modules in the form of adapters plug into physical resource devices to provide resource node capabilities. Furthermore, adapters comprising embedded boards, chips, ASICs, FPGAs, or other components integrate into a larger physical resource device to provide resource node capabilities, hi addition, modules include enclosures that house one or more physical resource devices. Such modules include operating systems and communication stacks as firmware infrastructure to support resource node functionality. Because modules can comprise software, it is also contemplated software programs or drivers install on computers or other computer-like systems to provide resource node capabilities for the individual resources within the computer.
  • Contemplated software or firmware executing on processing unit 210 that provides resource node 200 with its functionality including monolithic code that supports multiple resource nodes, collections of functions operating together, or individual tasks or threads.
  • Resource node 200 operates as a virtual structure that presents a logical representation of a portion of physical resource 260A through 260M to resource consumers.
  • Contemplated logical representations include logical partitions of disk drives or logical video frames of a monitor. The logical representations combine to form the complete disaggregated resource, which itself is a logical representation of a resource.
  • resource node 200 exists in a module that supports multiple other resource nodes, it is contemplated all resource nodes have unique addresses.
  • each resource node acquires an address from external to the enclosure, from a DHCP server for example.
  • multiple resource nodes share a common physical address, an Ethernet MAC address, when obtaining an address by using unique virtual physical addresses when communicating with the address server.
  • an enclosure can obtain multiple IP addresses from a DHCP server by using virtual physical addresses for each resource node in the "chaddr" field, the client address field, of the DHCP client request even though the module has a single real MAC address. This approach alleviates issues where a DHCP server does not correctly implement the DHCP client ID properly.
  • a resource node comprises the ability to function independently of all other resource nodes and offers resource consumers access to the physical resource for which the resource node is responsible without regard to all other resource nodes composing the disaggregated resource. Consequently, resource nodes combine together forming different structures based on how a group as a whole is intended to function.
  • there are four types of group structures a group of one operating as a complete resource, a parallel group structure where resource nodes provide redundant resources, a serial group structure where resource nodes provide no redundancy, or a combination of parallel and serial groups.
  • a single resource node that operates as a group of one providing a single complete resource is a trivial case and does not represent a disaggregated resource.
  • Figure 3 A illustrates an example of a parallel group of four resource nodes.
  • Disaggregated resource 300 comprises resource nodes 305 A through 305D and represents a single resource from the perspective of resource consumer 310.
  • Resource consumer 310 interacts with disaggregated resource 300 through communication path 315.
  • Resource nodes 305A through 305D offer the same fundamental resource, resource "A," to resource consumer 310. Therefore, if one of resource nodes 305 A through 305D disappears from view of resource consumer 310, resource consumer 310 still is able to fully utilize disaggregated resource 300 even though disaggregated resource 300 has lost a resource node because the remaining nodes offer redundancy.
  • An example of a parallel group includes a data storage array employing a number of mirrored disks.
  • a parallel group includes a video display comprising multiple monitors displaying the same video data. Should one monitor fail, the remaining monitors continue to provide the video data.
  • a parallel group comprises two or more logical disk partitions that mirror data from another group of one or more logical disk partitions.
  • Figure 3 A presents an example of four resource nodes, it is contemplated that a parallel group can comprise one or more resource nodes. It is further contemplated that a parallel group can comprise one or more other groups. Other contemplated parallel groups include archives or backups where data is stored for long term storage.
  • Disaggregated resource 320 comprises resource nodes 325A through 325D.
  • Disaggregated resource 320 appears as a single resource to resource consumer 310.
  • Resource consumer 310 communicates with disaggregated resource 320 over communication path 315.
  • Resource node 325 A through 325D each offer a single piece of a resource, "A" through “D” respectively, where resource consumer 310 requires resource "ABCD" to operate properly.
  • Resource node 325 A through 325D offers no redundancy; therefore, if one of the nodes disappears from view of resource consumer 310, disaggregate resource 320 would no longer be usable.
  • serial groups for storage arrays include two or more disks comprising RAID-O stripes or two or more disks where the disks are spanned together to offer larger capacity. If one disk fails, the storage array is rendered useless until the array is fixed.
  • Other contemplated serial groups including a parity group, or an individual group where a parity group represents a group that comprises parity information and where an individual represents a group of a single resource node.
  • An alternative example of a serial group includes a composite video display comprising two or more monitors that are combined into a single larger display. If one monitor fails, then the complete display is not useful because some video information is missing.
  • Figure 3C illustrates an example of a combination group comprising a parallel group of two redundant serial groups.
  • Disaggregated resource 350 comprises serial group 330 and serial group 340, each offering the same resource "ABCD" to resource consumer 310.
  • Resource consumer 310 communicates with disaggregated resource 350 over communication path 315.
  • Resource nodes 335A through 335D provide access to resource "A" through “D” respectively as do resource nodes 345A through 345D.
  • An example of a combination group includes a RAID 10 storage array where data is striped across multiple disks and those disks are mirrored. It a preferred embodiment, combination groups comprise two or more logical disk partitions. Furthermore, in a preferred embodiment combination groups comprise one or more other groups.
  • One or more resource nodes combine to form a group.
  • One or more groups form an additional group.
  • a disaggregated resource is a single top level group comprising one or more other groups. Groups become complex as resource nodes combine to offer various capabilities and services and because each resource node operates independently other nodes. Resource consumers require information to understand how resource nodes combine to form a disaggregated resource in order to properly interact with the disaggregated resource and to determine if the disaggregated resource has coherency.
  • groups are classified by types where a type specifically references a role played by a collection of resource nodes. Furthermore, it is contemplated that resource consumers access a group collectively in addition to accessing individual resource nodes within the group, hi the preferred embodiment, resource consumers access the group collectively through multicast messages.
  • Resource consumers obtain a view of a disaggregate resource based on information about resource nodes.
  • Resource nodes are unaware of their role in a larger disaggregated resource because they focus on their main responsibility of providing access to a physical resource. Consequently each individual resource node lacks information about the complete disaggregated resource.
  • Resource consumers obtain resource node information from each of the individual resource nodes and combine the information to form a description of the disaggregated resource in order to properly interact with it.
  • resource node information resides within in a memory associated with the resource node; however, it is contemplated the resource node information can reside in alternative memories.
  • a disaggregated resource comprises only a serial group
  • a resource consumer only needs sufficient information to dete ⁇ nine if the serial group has coherency, hi other words, the resource consumer only needs to know if members of the group are missing. If the serial group does not have coherency (members of the serial group are missing), the disaggregated resource is not useable. Therefore, a resource consumer does not need complete information, i.e. information about which members are missing, but rather sufficient information indicating that members are missing.
  • a disaggregated resource comprises parallel groups
  • the resource consumer only requires sufficient information to determine if, through the redundancy of the parallel groups, the disaggregated resource is usable. Again, the resource consumer only needs sufficient information rather the complete information regarding the disaggregated resource.
  • Resource node information comprises elements that include the group type of a group of which a resource node is a member, resource node attributes, resource name, resource node name, resource node address, or other information desired by resource consumers to determine the coherency of a disaggregated resource. Resource consumers also use resource node information to determine the role of each resource node. If a resource consumer knows the role of a resource node, then the resource consumers is able to partially assist the resource node by performing operation in anticipation of passing resource requests to a resource node. For example, if a resource node represents a RAED-O stripe, a resource consumer could perform necessary modulo arithmetic based on an LBA to determine if the stripe should receive a specific data block.
  • Resource nodes combine together to form groups of different types. Each type of group represents a desired functionality, capability, or service. A group type provides resource consumers partial information regarding the role a group plays in the disaggregated resource.
  • Figure 4A presents a set of possible group types for resource nodes composing a disaggregated storage array.
  • the resource nodes are logical disk partitions in this example. Assume the storage array comprises a data set of sequential data blocks "ABCDEFGHI" that could represent a single file.
  • the resource map of each logical disk partition determines for which data blocks the logical partition is responsible.
  • Logical disk partitions 410A and 410B are mirrors because they both are responsible for identical data and provide redundancy; therefore, each logical disk partition has a "mirror" group type.
  • each logical disk partition is a group of one node in the example shown.
  • Mirror groups are a form of parallel group and have resource that is redundant to another group.
  • Logical disk partitions 420A through 420C represent a stripe group where individual data blocks are written across logical disk partitions 420A through 420C sequentially, then wrap around.
  • Logical disk partitions 430A through 430B represent a span group where data fills one logical disk partition then flows over to the next logical disk partition.
  • Both a stripe group and a span group are serial groups, hi a preferred embodiment, contemplated group types include mirrors, stripes, or spans.
  • contemplated mirror subtypes include snap-shots, backups, or deferred mirrors; contemplated stripe sub-types include parity. Snap-shots are a type of mirror group that captures a dataset at a specific point in time.
  • Backups are a type of mirror group that archives a data set.
  • Deferred mirrors are a type of mirror that has an I/O latency with respect to a primary group.
  • Parity is a type of stripe comprising parity information. Parity groups are useful when implementing RAID-3, 4, 5, 6, or other storage arrays that require parity.
  • Figure 4B presents another example of a set of possible group types for resource nodes composing a disaggregated video display.
  • the resource nodes are logical video frames in this example.
  • Logical video frames 440A and 440B represent mirror groups where the same data is displayed on the two logical frames.
  • Logical video frames 450A through 450F compose a composite display and are therefore a composite group.
  • the logical video frames 440A through 440F do not have to be located on the same monitor or have to be located physically close to each other.
  • group types are created based on the type of disaggregated resource.
  • group types regardless of the type of disaggregated resource, mirrors for example.
  • Composite types are essentially a combination of span and stripe. Groups can become quite complex depending on how the different group types combine to from a disaggregated resource.
  • groups are organized according to rules for manageability providing resource consumers structure so they can determine how groups should fit together and what role each group takes on.
  • group types are hierarchically organized. For example in storage arrays it is contemplated that mirror groups are the top most group because each mirror represents one data set. The next level comprises stripe groups that provide performance enhancement, and then at the lowest level are span groups that provide capacity enhancement.
  • group type information is encoded in a memory.
  • the group type information combines with a resource node name.
  • group types also represent results of operations performed on other groups. For example in a disaggregated storage array, if a stripe group of logical disk partitions has a "mirror operation" performed on it, the result is a mirror group comprising two stripe groups, the original stripe group and a new stripe group.
  • Resource node names provide one or more pieces of information to remote consumers including identification or group membership information, logical position information within a disaggregated resource, partial role information, or other information that a resource consumer might need to access a disaggregated resource.
  • Identification and membership information allows a resource node to uniquely identify itself with respect to other resource nodes whether they are in the same disaggregated resource or from other disaggregated resources.
  • Resource consumers can resolve the resource node name to an address which allows the resource consumer to communicate directly with a specific resource node.
  • Resource node names and addresses ensure each resource node is distinguishable from other resource nodes.
  • a resource node comprises the name of the disaggregated resource.
  • a resource node name comprises a unique identifier including a UUID or a GUID.
  • a resource node name of a preferred embodiment resolves to an internetworking address including a unicast address or a multicast address.
  • a resource node name comprises position information that informs a resource consumer where in a disaggregated resource a resource node fits.
  • the position information comprise a group type. Therefore, a resource node name carries partial resource node role information.
  • Figure 5 A provides an example of a physical structure of a possible disaggregated storage array comprising a number of resource nodes where the resource nodes are logical disk partitions.
  • the example is provided to clarify how resource consumers can utilize resource node names.
  • a resource consumer operates on data set comprising data blocks "ABCDEFGHI" stored on disks 501 through 505.
  • the storage array comprises three complete, redundant data sets for reliability.
  • Each disk comprises one or more partitions controlled by a resource node that represents each partition as an addressable, logical disk partition.
  • Resource node 510 is a mirror partition comprising the complete data and is; therefore, a mirror group of one resource node.
  • Mirror group 520 also comprises the complete data stored on three striped resource nodes 522, 524, and 526.
  • Mirror group 530 also comprises the complete data set stored on two striped resource nodes 532 and 534 and span group 535 that functions as a third stripe.
  • Span group 535 comprises resource nodes 537 and 539. From the perspective of a resource consumer, this collection of resource nodes appears as a single logical volume that stores data set "ABCDEFGHI.” The resource consumer attempts to make sense of the structure through each resource node's name.
  • Figure 5B presents a logical representation of the same disaggregated resource of Figure 5 A assuming groups are organized hierarchically as follows from top to bottom: mirrors, stripes, and spans.
  • Logical volume 500 comprises the groups to fo ⁇ n a single disaggregate resource.
  • Table 1 lists a possible name associated with the eight nodes composing logical volume 500.
  • Table 1 assumes logical volume 500 has name "Z” and that mirror groups are designated by the letter “M” and a number, hi addition, stripe groups are designated by the letter “R” and a number. "R” is used in the example to reflect stripes are a RAID 0 group. Span groups are designated by the letter "S.”
  • Resource node names provide a resource consumer enough information to understand the structure of the disaggregated resource represented by logical volume 500; however, the naming structure only partially describes the role played by each resource node.
  • resource node 532 represents mirrored data as indicated by the portion of the name "M2;” however, there is no indication that it is also a stripe.
  • resource node 534 its name carries sufficient information to describe the node's role.
  • Table 1 presents one contemplated naming scheme. All other naming schemes that provide structure are contemplated.
  • each resource node name reflects its initial membership of a group in a hierarchical fashion based on group type.
  • root node 510 and resource node 522 are established with names "Z" and "Z.M1" respectively to create two mirrors of each other.
  • additional groups and nodes including striped nodes, to add further functionality it is preferable to leave established names as they are rather than renaming nodes to reflect modifications without having to rename all nodes which would require intensive bookkeeping.
  • resource consumers determine the overall structure of a disaggregated resource and identify gaps in the structure. For example, if resource node 524 does not respond to a discovery request by a resource consumer, the resource consumer understands there is a gap between resource node 522 with name "Z.M1" and resource node 526 with name "Z.M1.R2.” Therefore, the resource consumer has sufficient information to dete ⁇ nine that mirror group 520 (as serial group of striped resource nodes) is incomplete and can not be used.
  • a resource consumer can not know the last member in a group because it is always possible that the last resource node might not have responded.
  • resource node 526 of mirror group 520 does not respond to a discovery request, the resource consumer would believe that resource node 524 is the last member of mirror group 520 and would not know that group is incomplete. It is contemplated that resource node names carry complete information regarding group memberships so that resource consumers can make a full determination of disaggregated resource coherency; however, such naming is inefficient because names can become long, disorganized, or unmanageable. It is also contemplated that resource node names have additional characteristics including being discoverable via discovery request message, or comprise information regarding resource node responsibilities.
  • Resource node names form two different classes: those that provide resource node role information and those that do not. Resource node role information helps resource consumers determine how to properly interact with individual resource nodes. For example in the case of a mirror group 520, resource nodes 522, 524, and 526 are striped nodes. A resource consumer can infer through the names of resource nodes 522, 524, and 526 are each stripes. Therefore, based on knowledge of how stripes operate (stripe depth, LBA ranges, module arithmetic, or other information), a resource consumer can determine which resource node is responsible for individual LBAs in a logical volume. Consequently, the resource consumer performs LBA calculations locally to determine which resource node is responsible for an LBA, then directly interacts with an individual resource node.
  • the interaction is through a unicast message directed from the resource consumer to the address of a resource node. If resource nodes handle LBA calculations, then resource node names only have to carry organization information and do not have to carry resource node role information. In this case, a resource consumer directly interacts with the entire group collectively and lets each member of a group determine if any action is required on their part. For example, a resource consumer interacting with mirror group 520 sends a single message comprising an LBA to resource nodes 522, 524, and 526 collectively. It a preferred embodiment, the interaction is through a multicast message directed from the resource consumer to the multicast address of the group. Given this case, developers can use any naming or tagging system that results in a discemable structure for a disaggregated resource and is understandable by a resource consumer.
  • Resource node attributes contribute to determining the role of a resource node, to determining the coherency of the disaggregated resource, or to helping a resource consumer to properly interact with a disaggregated resource.
  • Resource consumers use resource node attributes to determine if a serial group of resource nodes is complete or has coherency. It is contemplated that resource nodes have coherency attributes where each attribute is associated with a group type. For example, in a storage array representing a disaggregated resource with mirror groups, stripe groups, and span groups, coherency attributes could include a mirror attribute, a stripe attribute, and span attribute, hi a preferred embodiment, each coherency attribute indicates that there are additional members, and therefore, nodes, associated with a particular group. Table 2 illustrates how coherency attributes contribute to a resource consumer's understanding of a disaggregated resource structure and the roles each node plays in the disaggregated resource based on logical volume 500 example presented in Figure 5B.
  • the "M” attribute indicates that there is an additional mirror group; the "R” attribute indicates that there is an additional stripe member, and the "S” indicates there is an additional span member.
  • the attributes are set on the individual nodes, the attributes are associated with the groups because groups can comprise other groups that further comprise multiple resource nodes, for example mirror group 530 comprising span group 535 which functions as a stripe.
  • a resource consumer interacting with logical volume 500 with attributes has sufficient information to determine if the serial groups comprising logical volume 500 have coherency and are complete. For example, the resource consumer can determine the completeness of mirror group 520, a serial group comprising striped resource nodes 522, 524, and 526, by checking the attributes even if a resource node is not visible. If resource node 526 is not visible to the resource consumer, then the resource consumer can determine that there are additional expected groups based on the "R" attribute being set on resource node 524.
  • the resource consumer can determine that there are no additional members because the "R" attribute is not set on resource node 526 indicating there are no additional stripes and, therefore, can determine that the group is complete.
  • the attributes also contribute information that allows a resource consumer to infer what role a resource node plays in the disaggregated resource.
  • resource node 522 has the name "Z.M1" which partially indicates its role, that it is a mirror. However, the name does not contain information regarding its role as a stripe.
  • the resource consumer can infer that resource node 522 is stripe by detecting that the "R" attribute is set indicating that an additional stripe is expected implying resource node 522 is also a stripe.
  • Coherency is important when multiple resource consumers access a shared disaggregated resource where the physical resource has state from the perspective of the resource consumers. Due to a number of reasons including unreliable networks, a resource consumer could loose track of a resource node in a parallel structure; consequently, that node could become out of synch with other resource nodes even though the complete disaggregated resource still provides functionality from the perspective of the resource consumer due to redundancy.
  • logical volume 500 is a parallel group comprising three mirror groups, root node 510, mirror group 520 (a serial group of striped resource nodes), and mirror group 530 (a serial group of striped and spanned resource nodes) all comprising identical data sets. If a first resource consumer can see only mirror groups 520 and 530, but not 510, then it could properly interact with the two visible groups. However, if a second resource consumer can see only mirror group 510, but not 530 and 520, then the second resource consumer can properly interact with group 510. Unfortunately, data on logical volume 500 is at risk if both resource consumers write to the volume because the volume no longer has coherency resulting in the groups losing synchronization.
  • Coherency attributes can include a count of the number of groups or resource nodes comprising a parallel group. In the case of logical volume 500, there are three parallel groups supplying redundant data; therefore, the member count is three. Furthermore, a resource consumer, upon detection that a group or node is missing, can change a valid count on the visible resource nodes to indicate the how many groups are valid. For example, in the case of the first resource consumer seeing only mirror group 520 and 530 would set a valid count to two (as opposed to three). Any other resource consumer would see a discrepancy between the valid count (2) and member count (3) and know the disaggregated resource has lost coherency.
  • resource nodes have both a resource valid attribute indicating how many members of a group are valid and a resource member attribute indicating the number of expected members in the disaggregated resource.
  • resource nodes comprise a key to ensure that only one resource consumer is able to modify resource attributes at a time, or to indicate when group member resource node attributes are modified.
  • Contemplated keys include random numbers, sequential counts, UUID, or GUIDs.
  • resource consumers periodically check the coherency of a disaggregated resource to ensure it is able to services resource requests.
  • through an advantageous selection of rules for creating an organizational structure of resource nodes only one attribute is required for each group type rather than an attribute for each created group.
  • Additional resource node attributes are also contemplated. Specifically, attributes that indicate whether resource nodes are owned or share provide utility when multiple resource consumers interact with a disaggregated resource simultaneously.
  • resource node capabilities include resource node capabilities, physical resource information, detailed resource node resource map information, or other information that enhances the operation of a disaggregated resource with respect to a resource consumer.
  • resource node capabilities within storage array include block size, stripe size, logical partition capacity, or LBA responsibility information.
  • resource node capabilities within a disaggregated video display include logical video frame size, pixel color depth, decoding information, or aspect ratio. All information regarding a resource node is contemplated.
  • Resource maps provide a translation from a resource element ID, an LBA or logical pixel coordinate for example, to a physical address, a hard disk storage block or a pixel on a video monitor for example.
  • physical address means an address that is directed toward a physical resource; therefore a physical address includes intermediary addresses that are further directed toward the physical resource. Resource maps, especially those located within a resource node determine a resource node's responsibility.
  • Resource maps are either static or dynamic. Static resource maps are stored in a memory and contain sufficient information for translation. Dynamic resource maps are also stored in a memory and are built based on resource node information. Resource consumers build a dynamic map, if necessary, and determine if the map is complete through the resource node information.
  • Figure 6 depicts an example of a resource map. Resource consumers access a disaggregated resource through the use of resource map 600 by referencing elements of the disaggregated resource via resource element IDs 610A through 610N. Resource map 600 includes a translation from resource element IDs 610A through.610N to physical addresses 620A through 620N. A resource map comprises a combination of software or data structures to achieve the goal of translation.
  • resource map 600 comprises a software function that translates resource element IDs 610A through 610N to physical addresses 620A through 620N. Consequently, resource maps provide a translation from a virtual coordinate system used to represent the disaggregated resource to a physical coordinate system across physical resources.
  • a resource consumer collects resource node information from resource nodes that compose a disaggregated resource.
  • the resource consumer then constructs a dynamic resource map in memory based on resource node names, resource node attributes, resource node addresses, or group types. If the resource consumer has an understanding of the roles of each resource node, then the resource consumer can have an understanding of how resource element ID 610A through 610N are distributed across all the resource nodes.
  • a resource map takes the form of software function employing modulo arithmetic to determine which resource node is responsible for an LBA. The physical address in this case would be the address of the node responsible for the LBA.
  • a resource element ID translates to one or more physical address in order to support parallel groups including mirrors.
  • the complete map of a disaggregated resource splits into a number of smaller maps; each of them also a resource map.
  • Resource consumers need sufficient information to create a map that allows it to interact with the disaggregated resource.
  • Resource nodes have a fine grained mapping of a physical resource which is often a static map.
  • Figure 7 depicts an example of a split resource map of disaggregated storage array.
  • Split resource map 700 comprises resource sub-map 725 stored in first memory 720 and resource sub-map 735 stored in second memory 730.
  • Resource sub-map 725 translates LBAs 726A through 726N to resource node addresses 728A through 728N.
  • Resource sub-map 735 represents an example static resource map of a resource node addressed in resource sub-map 725.
  • Resource sub-map further translates LBAs 736A through 736M to physical addresses 738 A through 738M representing a physical location on a disk drive.
  • Resource sub-map 725 and resource sub-map 735 do not have to have the same number of elements because a resource node's resource map generally represents a fraction of the complete disaggregated resource.
  • a resource consumer interacting with the example storage array uses resource sub- map 725 to find an address of a resource node.
  • Resource consumer sends requests to a specific resource node, if required.
  • the specific resource node could then use a resource map similar to example resource sub-map 735 to interact with a disk.
  • resource maps and resource sub-maps employ software functions for translation.
  • the software functions are used to yield resource node responsibilities including striping, parity, spanning, mirroring, or composite responsibilities.
  • Each resource map in a disaggregated resource comprises a level of granularity.
  • Resource consumers often utilize coarse grain resource maps with a granularity at the resource node level.
  • the resource node level corresponds to a granularity at physical partition level that is at or below a physical resource level.
  • a resource node represents a logical disk partition that corresponds to a disk partition that is smaller than a disk drive.
  • a resource node represents a logical video frame that corresponds to a frame that is smaller than the video monitor.
  • Figure 8 depicts several resource maps with various levels of granularity.
  • Physical resource 840 comprises physical resource partition 843 and 845.
  • a resource node is responsible for providing access to physical resource 840, and more particularly to resource partition 843.
  • Resource map 810 has a granularity at the resource partition level because it translates resource element IDs to physical addresses that correspond to a physical resource partition level.
  • the physical address of a physical resource partition corresponds to an address of a resource node.
  • Physical resource partition 840 further comprises physical resource elements 844A through 844N.
  • Resource map 820 has a granularity at the resource element level that is below the physical resource partition level because it can address individual physical resource elements 844 A through 844N.
  • Contemplated resource elements include addressable blocks on a disk drive.
  • Resource nodes typically employ resource maps that have granularity at the physical resource element level. It is contemplated that resource map can have a granularity below the physical resource element level.
  • Physical resource elements 844A through 844N comprise smaller pieces as depicted by atom 845.
  • disk drives have addressable blocks where each block comprises a number of bytes and each byte comprises a number of bits.
  • Resource map 830 has a granularity at the atomic level. It is contemplated that resource nodes that employ resource maps having an atomic level granularity or below use multiple operations to access the physical resource. In the case of a storage array, a resource node would use a read- modify-write operation to change a byte within a block.
  • Figure 9 depicts a possible set of steps by which entities can interact with disaggregated resources.
  • a manager creates a disaggregated resource.
  • the manager responds to a need for the disaggregated resource and creates two or more resources nodes that have responsibility for physical resources in order to attempt to fulfill the need.
  • an IT manager establishes that an enterprise requires several terabytes of disk storage comprising mirrored data.
  • the manager creates a number of logical disk partitions (resource nodes) representing partitions on a set of disk drives where the collection of logical disk partitions substantially fulfills the capacity and mirroring requirements.
  • the manager uses a one or more software utilities designed to communicate with resource consumers, resource nodes, or modules that connect to physical resources.
  • a manager includes an administrator, a software package used to manage disaggregated resources, management servers, or management appliances. It is further contemplated that a manger includes a resource consumer, or a computer system external to the resource consumer and disaggregated resource.
  • the manager establishes an organizational structure for the resource nodes.
  • the organization structure provides definitions for the roles and responsibilities of each resource node. Once the roles and responsibilities are established, a manager creates a corresponding set of resource node information and resource maps for deployment into the resource nodes. Resource consumers use knowledge of the rules for forming the organizational structure in order to convert resource node information in a useable map of the disaggregated resource or determine if the disaggregated resource has coherency.
  • the organizational structure comprises a hierarchy based on group types.
  • the hierarchy groups include mirrors, stripes, composites, or spans. Other contemplated hierarchical structures include trees, or directed graphs. Non-hierarchical structures are also contemplated including flat sequential lists with no preferred ordering of resource nodes.
  • a manager assigns resource node information and resource maps to each of the resource nodes composing the newly created disaggregated resource.
  • Each resource node receives the resource node information and the resource map information then stores the information in a memory.
  • the resource node information provides resource consumers information regarding what role the resource node takes on in the disaggregated resource.
  • the resource map determines the responsibility of the resource node.
  • resource node information includes group types, resource name, resource node name, or resource node attributes.
  • a resource consumer wishes to access the disaggregated resource. If the resource consumer is not aware of the disaggregated resource, the resource consumer sends a discovery message to the disaggregated resource. Possible discovery methods include reading resource node from a local information store, sending broadcast message to resource nodes, using standardized protocols including SSDP, requesting resource node information from a server external to the disaggregated resource, or other methods that result in delivering resource node information to the resource consumer. In a preferred embodiment, discovery includes identifying the physical modules that comprise resource nodes and sending name resolution requests with possible regular expression information to all resource nodes. If a resource node receives the request and its name matches the regular expression, it will respond. Contemplated discovery requests include those sent via unicast to individual resource nodes, via multicast sent to groups of resource nodes, or via broadcasts sent to one or more groups of resource nodes.
  • each resource node receives discovery messages sent by resource consumers. Upon reception and verification of authorization, if any authorization is required, of the discovery message, each individual resource node retrieves its own resource node information from its memory. The resource node information is formed in to a response message which is sent back to the resource consumer at step 930. The resource consumer attempts to collect responses from all responding resource nodes. It is contemplated that not all resource nodes will respond. In a preferred embodiment, resource consumers have the ability to cycle through multiple discovery attempts to capture as much resource node information as possible given an unreliable network.
  • the resource consumer collects resource node information from responding nodes and constructs a resource map of the disaggregated resource.
  • the resource consumer has an understanding of the rules of the organizational structure used to create the resource node information. Based on the rules and on the resource node information the resource consumer can determine if the constructed resource map is sufficiently complete or if the disaggregated resource has coherency. If the resource map is sufficiently complete, then the disaggregated resource is usable by the resource consumer. A sufficiently complete resource map is still considered complete as long as some members of a parallel group are visible. For example, if a storage array comprises a number of mirrored groups, each being a member of a parallel group, then one mirror could disappear from view, but the storage array would still be useful.
  • the resource consumer assesses the constructed resource map and at step 942 it decides if the disaggregated resource has coherency. If the disaggregated resource has coherency, then the resource consumer can begin using the disaggregated resource at step 950. If the disaggregated resource does not have coherency, then the resource consumer determines if the resource map has sufficient group members to still be useable. In a preferred embodiment, the resource consumer checks resource node names, group type, or resource node attributes to check for coherency.
  • a preferred embodiment sorts the list of responding resource nodes, or removes duplicates, if any.
  • the resource consumer decides if the remaining group members are able to provide sufficient capabilities to service requests from the resource consumer. This is possible in the case where there are sufficient parallel groups, or redundant resources. If there are sufficient capabilities remaining, the resource consumer will modify the resource node attributes at step 945 to indicate a change in coherency to be detected by other resource consumers. In a preferred embodiment, a resource consumer modifies coherency attributes including a resource node valid attribute. If the disaggregated resource is not usable, the resource consumer optionally reports an error or optionally attempts the discovery process again at step 920 to pick up lost members.
  • a resource consumer could probe for specific lost members based on detecting gaps within the map structure and on sending regular expression discovery messages tailored to check for names the could appear in the gaps. After modifying resource node attributes at step 945, the resource consumer can begin using the disaggregated resource at step 950.
  • a resource consumer interacts with the disaggregated resource by sending messages to the disaggregated resource, hi a preferred embodiment, sending messages comprises sending packets over a packet switched network to all the resource nodes, to groups of resource nodes, or to individual resource nodes that compose the disaggregated resource.
  • sending messages comprises sending packets over a packet switched network to all the resource nodes, to groups of resource nodes, or to individual resource nodes that compose the disaggregated resource.
  • communications between the resource consumer and disaggregated resource take the form of UDP packets that are broadcast, multicast, or unicast.
  • a resource consumer employs optimization techniques when communicating with resource nodes over an unreliable communication path.
  • Contemplated optimizations techniques including employing a slow start or congestion avoidance similar to TCP to ensure the communication path is solid.
  • One factor that contributes to a communication path being unreliable is the physical nature of the infrastructure. For example, hubs, switches, or routers form a packet switched network that could represent an unreliable communication path. Not all networking equipment is the same nor does the equipment faithfully implement protocols or standards. Consequently, in an especially preferred embodiment where resource consumers use UDP to communicate with resource slow start or congestion avoidance ensures networking equipment is not overwhelmed due to speed of communications.
  • all resource nodes that are capable of receiving messages from a resource consumer accept the messages. Because resource consumers can broadcast a single message to a group of resource nodes, one or more resource nodes within a group can receive the message. Each individual resource node determines if the message requires their individual attention based on their role or responsibility as set forth by their resource node information or resource maps. For example, a message from a resource consumer can include a resource element ID which the resource node compares against its resource map. If a resource node determines that is should not take any action, the message is silently discarded. If the resource node determines it should take action on the message, it proceeds to analyze the message.
  • a resource node determines if the resource consumer message is a request for a resource for which the resource node is responsible. If the message is a non-resource specific message, then message is handled appropriately at step 985 and a response is generated and sent if necessary, and resource consumers processes the response at step 990.
  • Non-resource request messages include configuration messages, management messages, firmware updates, or other messages that are of interest to the resource node beyond a request for the resource. If the message is a resource request, the message is further analyzed.
  • a resource node bypasses a communication stack to deliver the message directly to the main functionality of the resource node in order to improve over all performance of the resource node.
  • a resource node comprises a TCP/IP protocol stack that supports UDP packets.
  • the resource node checks the packet and delivers the packet data directly the resource node application rather than requiring full processing of the TCP/IP stack eliminating superfluous computation.
  • resource node checks the data in the resource request message to ensure that the resource node is able to handle the request or to determine if the request is an access fault.
  • resource request messages include one or more referenced (directly or indirectly) resource element IDs.
  • the resource node checks its responsibilities to determine if it is able to handle the referenced resource element IDs. If some of the resource element IDs fall outside the scope of the resource node's responsibilities, then an access fault has occurred.
  • the resource node reports the access fault back to the resource consumer and the resource consumer processes the response at step 990.
  • a access fault shows a resource consumer that it is attempting to bridge resource nodes.
  • the first resource node will generate an access fault.
  • the resource consumer can then break the message up into individual requests each having a single LBA resulting in an aligned write to the resource nodes. If the message is not an access fault, the resource node continues its handling of the resource request.
  • the resource node fully processes the resource request.
  • the full processing depends on the type of resource for which the resource node is responsible. Contemplated processing includes, I/O processes, checking status, resource management, resource node information management, resource map management, or other processing as determined by the resource node's responsibilities or a protocol designed for use with the resource node.
  • one type of processing that a resource node conducts is the processing of a proxy command at step 966.
  • a proxy command instructs a resource node to communicate with another resource node as if the first resource node were a resource consumer. The first resource node then relays the message to the second resource node which is waiting at step 955.
  • the resource node continues processing the resource request and generates a response sent back to the resource consumer.
  • the resource consumer receives the response at step 990.
  • resource consumers and resource nodes communicate over an unreliable communication path that could result in loss of coherency of the disaggregated resource.
  • multiple independent resource consumers access disaggregated resources simultaneously, each resource consumer having its own view of the disaggregated resource which could be out of synch with the other resource consumers. Therefore, in a preferred embodiment a resource consumer will periodically check for disaggregated resource coherency.
  • the resource consumer checks to see if coherency requires checking. If so, the resource consumer returns to step 920 and initiates a discovery.
  • the periodicity of checking for coherency or conditions when coherency should be checked varies depending on the resource consumer and disaggregated resource environment. In a preferred embodiment, periodicities are greater than or equal to one second.
  • An example of a disaggregated resource includes a data storage array comprising a plurality of hard disk drives.
  • the storage array is virtualized by two or more resource nodes represented by logical disk partitions.
  • Each logical disk partition has responsibility for a partition on a disk and comprises an IP address used to communicate with resource consumers.
  • a logical disk partition also comprises a resource map that translates LBAs to physical disk locations used during I/O operations.
  • the collection of logical disk partitions appears as a locally connect, logical volume to computer systems that represent resource consumers.
  • each logical disk partition When the storage array is built, the logical disk partitions are created and given names that reflect their position and role in the group hierarchy of the logical volume.
  • the group hierarchy includes groupings based on mirrors, stripes, or spans.
  • the storage array represents itself as a single logical volume with a name.
  • Each logical disk partition incorporates the logical volume name and group type information within the name of the logical disk partition.
  • each logical disk partition also includes attributes indicating if there are more logical disk partitions in each type of group.
  • a resource consumer in the form of a software driver installs on a workstation beneath an operating system or file system.
  • the low level placement of the driver allows existing software and applications to utilize the storage array without requiring modification.
  • the driver encapsulates FO requests into a packets sent to the logical volume.
  • the requests are sent to the logical volume collectively through a multicast message where the logical partitions composing the logical volume are all members of the multicast group. It is contemplated that the driver communicates with the logical volume through unicast messages as well.
  • the driver sends a discovery request to the logical volume and the logical disk partitions respond with their individual information including their names or attributes.
  • the driver collects the resource node information, it builds the initial coarse grained map of the logical volume where the map comprises the roles each logical partition plays and the addresses of the logical partitions.
  • the driver begins communicating directly with logical volume. If the map is incomplete as determined based on the names and attributes, the driver attempts to find missing elements or possibly generates errors. If the logical volume comprises mirrors, the driver is able to determine that a group of partitions could have lost coherency with the rest of the data set by detecting changes in the coherency attributes including the resource node valid attribute.
  • the driver sends read and write requests to the logical volume or to individual logical partitions.
  • each logical disk partition checks the LBAs of the requests and compare the LBAs to the logical disk partition's map of the disk partition to determine if action is required. If no action is required, the logical disk partition silently discards the packet; otherwise, the packet is processed.
  • logical disk partitions employ several optimizations.
  • Packets from the driver can bypass the logical disk partition's communication stack to improve performance.
  • packets are checked for access faults and the faults are reported back to the driver.
  • the driver interprets the access fault as instructions to break requests up so individual logical disk partitions process the requests naturally without requiring logical disk partitions to communicate with each. Other interpretations of access faults are also contemplated in order to properly handle or report error conditions.
  • the data storage array uses modules attached to disk where the modules provide the logical disk partition functionality.
  • Modules include rack mount enclosures that house 12 or 16 disks, desktop enclosures housing one to eight disks, single adapters that plug onto disks, or embedded modules integrated with other products.
  • a disaggregated resource includes a video display comprising a plurality of monitors.
  • the video display is virtualized by two or more resource nodes represented by logical video frames where each logical video frame has an address and an extent of pixels, mostly likely in an (x,y) coordinate system.
  • a logical video frame comprises a map that translates a virtual pixel address into a physical pixel address on one of the monitors.
  • the collection of logical video frames appears as a locally connected video display from the perspective of an electrical device.
  • Contemplated electrical devices include computers, TVs, video games, ATM machines, or other devices requiring a display.
  • the groups employed by the video display including composites groups where multiple logical video frames combine to form an aggregate display or mirror groups where the same video information is displayed more than once. It is contemplated a mirror includes a video recorder.
  • the logical video frames also have names associated with their group types and the name of the video display. Coherency checks for the video display are useful to determine if a monitor has failed.
  • a resource consumer for the video display can take the form of a driver.
  • a module comprising hardware, software, or firmware plugs into a video out of an electrical device and communicates with the video display transparently to the electrical device.
  • image decoding can occur within the resource consumer, a third system, or within the logical video frame. Image decoding is required to determine which pixels of the image display should be mapped to which logical video frame and to provide for proper scaling for pixels.
  • Alternative embodiments include other computing related resources including processors where the processor bandwidth is treated as a resource, memory where memory is segmented into partitions similar to a storage array, networking interfaces where QoS or bandwidth is treated as a resource, power supplies, audio I/O, input devices, or others.
  • Alternative embodiments also include non-computing resources including products supplied through a distribution system where a number of warehouses represent resource nodes.
  • disaggregated resource can combine to form a larger heterogeneous resource.
  • disaggregated resources including a storage array, video display, or processing array combine to form a disaggregated computer.
  • the inventive subject matter includes hardware or firmware of such adapters as well as licensing, selling, advertising, managing, distributing, or operating the adapters.
  • inventive subject matter includes methods of writing such software, recording the software on a machine readable form, licensing, selling, distributing, installing, or operating such software on suitable hardware.
  • the software per se is deemed to fall within the scope of the inventive subject matter.
  • Disaggregated resources provide a number of advantages. Individual resource nodes composing the disaggregated resource function independently of all other resource nodes which increases responsiveness or performance from the perspective of a resource consumer without requiring out-of-band communication. Each resource node provides partial information regarding the over map of the disaggregated resource allowing each resource consumer that requires access to the disaggregated resource to build its own effective map. This allows two resource consumers to share the same physical resources, or even the same logical resource, but access the disaggregated resources differently as required by their individual needs. For example, a first resource consumer could access one mirror group preferentially over another mirror group to reduce conflicts with the second resource consumer.
  • disaggregated resources place resource map information with required granularity where it is necessary to ensure efficient access.
  • the disaggregated resource can comprise resource maps with very fine levels of granularity offering a high degree of control over physical resources.

Abstract

Disaggregated resources distributed among resource nodes provide access to resource consumers by offering resource node information to the resource consumers. Resource node information supplied by each individual resource node comprises incomplete information with respect to the complete disaggregated resource. Resource consumers collect resource node information to create maps of the disaggregated resource, ensure coherency, or manage the disaggregated resource.

Description

DISAGGREGATED RESOURCES AND ACCESS METHODS
Field of The Invention
The field of invention is disaggregated resources especially storage devices, memories, displays, CPUs, or other computing resources.
Background of The Invention
As computers have increased in capabilities the demand for computing resources has also grown. In fact, the demand for computing resources has grown to the point that, in many cases, far outstrips the capabilities of a single computer to deliver resources to a user or application. For example, enterprises have been aware of this issue for many years and continue to buy racks of servers, storage arrays, or other computing resources at great cost to satisfy their needs for computing resources, hi addition, many consumers employ external hard disk drives to store massive amounts of personal media data because their personal computers lack sufficient capacity. There are many reasons that drive the demand and proliferation of computing resources including legislation that affect enterprises, applications, ubiquitous digital cameras, media players, and countless other reasons. Industry has responded to the demand by producing products that provide computer resources to users and applications where the computing resource products are no longer centralized to a single computer. Furthermore, the industiy is tending toward a distribution of computing resources where individual computer systems offer their capabilities or services to other users and applications where on example includes NAS file servers.
Yet another trend in the industry is to disaggregate resources into their constituent resource device elements, for example USB disk drives or SAN systems. However, these approaches do not address combining a number of resource elements together into a coherent virtual whole from the perspective of an arbitrary set of resource consumers, but rather these approaches still follow a centralized approach.
Even though the industry combines computing resources into a centralized set of capabilities or services as in SAN systems, it is still desirable to have the computing resources exist as individual resource nodes. Centralized resources imply further costs to due to the expense of the additional computer hardware and infrastructure; introduce yet another point of potential failure beyond the computing resources; create a bottleneck that all resource consumers must pass through, and so on. Disaggregated resources that comprise a collection of individual resource nodes that work independently but operate as a whole without a centralized controller or manager alleviate many of these problems. Individual resource nodes carry a smaller per unit price tag reducing incremental cost and offering stronger scalability, introduce no additional points of failure, do not require out-of-band communication increasing responsiveness, and operate in parallel increasing performance. Some known examples of distributed resources include clusters or applications like SETI@Home that offer CPU bandwidth as the computer resource. These examples are "distributed resources" where they rely on complete computer systems rather than individual computing resources and operate at an application level rather than at a resource device level.
Offering true disaggregated resources that comprise a number of resource nodes has a number of critical problems that must be overcome in order to deliver a solid disaggregated solution. First, networks are inherently unreliable and have latency; consequently resource nodes might loose connectivity with a resource consumer. Connectivity is important where the state or coherency of the resource from the perspective of a resource consumer is important. Second, multiple resource consumers can have different "views" of a disaggregated resource from each other; therefore, each resource consumer can have a different resource map used to access the disaggregated resource. Furthermore, most computer systems require access to a resource at a very fine level of granularity well below the resource device level. For example, when a CPU addresses memory, it attempts to reference a single byte or word. Such fine level granularity access is difficult in a disaggregated resource because a resource consumer does not necessarily have a sufficiently detailed and complete map of the disaggregated resource. Third, the organization of the resource nodes can be quite complex depending on a number of factors including type of resource involved, the roles or responsibilities of each resource node, resource node location, or other implementation specific information. Fourth, operating systems have to interpret the disaggregated resource as if it were locally connected in order to provide applications seamless, transparent integration with the computing environment.
A number of attempts have been made previously to provide a solid solution addressing the problems encountered when building a disaggregated resource. BitTorrent™, a peer-to-peer file transfer system, offers redundant file storage as a distributed resource where each resource node is complete computer system offering file storage. BitTorrent™ places redundant copies of data on multiple peers to alleviate some of the risk of an unreliable network; however, the peers are outside the control of the ultimate user so the user does not know if a peer is lost. Any owner of a BitTorreiit™ system can take down their computer reducing the overall performance for a remote unknown user. Although each BitTorrent™ user has a different view of the network based on connectivity, a user can not, in a deterministic fashion, determine the extent and usability of the system. Unfortunately, BitTorrent™ is an application level protocol and does not provide a transparent solution of a storage resource that allows an operating system or application to read and write data at will. Cleary, BitTorrent™ offers some utility for high level file transfers; it is not suitable for consumer or enterprise system due to the lack of control, determinism, fine level access, or performance. BitTorrent™ and other peer-to-peer systems have not addressed the need for resource consumers to understand a complete map for a coherent disaggregated resource at an elemental level.
Hitachi's U.S. Patent No. 4,890,227 offers a resource management system for operating systems of large scale computers. The memory, CPUs, I/O channels, and storage devices represent disaggregated resources. The management system relies on a set of policies that are continually updated and deleted to equitably and autonomously assign resources to process units (programs, threads, or tasks). The Hitachi patent does not address issues of resource coherency over an unreliable communication path or issues of multiple resource consumers (process units) having different views of the resources because a centralized management system handles all the resources for the process units. The Hitachi patent does not provide insight on how a resource consumer will manage and access resource nodes of a disaggregated resource that extend beyond the core computer. Even though the Hitachi patent addresses more elemental resources, the centralized resource management system does not allow resource consumers build their own view of the system or to function independently. Furthermore, the Hitachi patent offers no insight how to address fine level of structure of a resource.
Microsoft's U.S. Patent No. 6,912,622 attempts to resolve some of problems associated with a distributed resource that are similar to the problems associated with a disaggregated resource where the distributed resource is a peer-to-peer system. The
Microsoft patent uses an underlying statistical assumption regarding the probability of a first peer knowing a second peer's ID based on the "distance" between the first peer's ID and the second peer's ID. Through this structure, the Microsoft patent offers an efficient peer-to-peer name resolution system which allows a peer to keep track of and to find other peers by a useable organization scheme resulting in a map of the system. However, the structure only offers a way to access peers across a peer-to-peer network, but does not offer developers a way to access a fine level of detail within a peer as required by a truly disaggregated resource. Furthermore, the Microsoft patent does not address the need for a resource consumer to know the coherency of the disaggregated resource as resource nodes lose connectivity. In a peer-to-peer network, if a peer drops out, other peers don't necessary care. However, if the a resource node representing a CPU, memory, or storage device drops out of connectivity, all resource consumers using the disaggregated resource will need to know. Finally, the Microsoft patent does not address more elemental resource nodes.
Adaptec's U.S Patent No. 6,922,688 offers a method of accessing data objects where portions of the object are found through obtaining referential maps comprising logical storage locations and physical maps comprising the physical storage locations associated with the logical storage locations. Although the patent teaches how to access data objects distributed across a plurality of physical locations, it does not enable disaggregated resources, how to access such resources, how to operate disaggregated resources as one functional whole resource, or how to maintain a disaggregated resource over an unreliable network. In addition, although one aspect of the present invention comprises a split map, that map is a split map of disaggregated resources not a split map of data objects. Moreover, as discussed below, the term resource, as used herein, excludes data objects.
None of the previously presented examples fully address the problems encountered for building and accessing disaggregated resources. A more complete solution handles unreliable communications, resource consumer and resource node independence, resource coherency, fine level access to the resource, and applies to many types of elemental resources rather than create a solution for a single type of resource. Such a solution would have the following characteristics:
• A disaggregated resource would comprise independent, efficient resource nodes that do not necessarily communicate with each other and do not require out-of-band communications • The resource nodes provide information about their role in the disaggregated resource to resource consumers that request the information
• Resource consumers discover and access the resources without accessing extraneous systems
• Resource consumers construct their own view of the disaggregated resource based on information from the resource nodes
• A map of the resource provides access to a fine level of granularity to the resource at or below the resource device level
Thus, there remains a considerable need apparatus for disaggregated resources and for methods of accessing disaggregated resources.
Summary of the Invention
The present invention is directed toward disaggregated resources that are spread among a number of resource nodes. The resource nodes provide information about the nodes to resource consumers who then create maps of the resource in order to properly access the resource. The resource node information from a single resource node comprises insufficient information for a useable map of the disaggregated resource. Resource node information from multiple resource nodes allows for the creation of a more complete, useable map from perspective of a resource consumer. Another aspect of the invention provides for resource maps to be split among elements of the disaggregated resource and resource consumers in order to provide fine level access to the resource without the resource consumer storing an entire map of the resource.
Another aspect of the invention is directed toward methods for accessing the disaggregated resource through organizing resource nodes into the disaggregated resource, assigning resource node information, and assessing if the disaggregated resource has coherency. The resource nodes respond to discovery messages by providing their resource node information from which the resource consumers construct a map on how to access the resource. Resource consumers that wish to access the resource include resource consumers, resource managers, or other interested parities. In a preferred embodiment resource node information includes resource name, resource node name, resource node type, or resource attributes. Resource consumers that interact with disaggregated resources use the resource name to identify a particular resource. Furthermore, each resource node that composes the disaggregated resource includes a resource node name to differentiate themselves in cases where the nodes offer redundant physical resources. Through the use of names, the resource consumers select which resources are of interest and which resource nodes compose the disaggregated resource.
A resource node type provides resource consumers with the necessary information on how to access a specific node within the disaggregated resource. Resource nodes function independently of each and focus only on their prescribed roles and responsibilities.
Consequently, resource node attributes provide the resource consumers information on node state with respect to the disaggregated resource if resource state is critical relative to other nodes and provide resource node role information. Contemplated attributes include coherency attributes that indicate when a disaggregated resource has lost coherency. If a disaggregated resource loses coherency, then one or more resource nodes have lost contact with active resource consumers; consequently, the disaggregated resource might not operate correctly or might not offer correct services. Resource node information from an individual resource node comprises incomplete information for resource consumers to interact with the complete disaggregated resource.
In a preferred embodiment a resource consumer collects resource node information from resource nodes to create a map of the disaggregated resource. The resource map can be the same from one resource consumer to another or can be different from one resource consumer to another depending on what the resource consumers requires and how the disaggregated resource is configured. Furthermore, the resulting resource map has sufficient detail to allow a resource consumer to access the disaggregated resource. The resource map provides a translation of a resource element ID to a physical resource address, location, or capability. It is specifically contemplated that the resource map splits into a number of resource sub-maps where each sub-map provides a further translation. For example, a resource consumer accessing a disaggregated resource creates a top level resource sub-map that comprises a translation of a resource element ID to an address of a resource node. Each resource node also comprises a resource sub-map that translates a resource element ID to a physical resource ID. Consequently, it is contemplated that a resource map has a granularity below a resource unit level to a partition that represents some fraction of a physical resource. In addition, it is specially contemplated that a resource map has granularity below a partition level to an individual addressable physical resource element level, or even smaller.
Contemplated disaggregated resources include, but not limited to, computer related resources including storage, displays, CPU, input devices, or memory. Non-computer related resources are also contemplated including electrical power, or networking bandwidth. Specifically contemplated examples include disaggregated storage or disaggregated video displays. Furthermore, a disaggregated resource could comprise a heterogeneous mix of physical resources. For example, a number of disk drives, displays, CPUs, or memory where each individual item is disaggregated, then forms together into a complete disaggregated computer system.
Glossary
The following descriptions refer to terms used within this document. The terms are provided to ensure clarity when discussing the various aspects of the invention matter without implied limitations.
"Access Fault" means an error condition raised by a resource node when a resource consumer has requested access to a resource beyond the capabilities of the resource node. When the consumer receives the access fault, the consumer reconsiders the request and then forms a more reasonable request, or takes other desired actions.
"Resource" means a fungible commodity required or desired by a resource consumer for the consumer's operation. Resources include displays, I/O channels, data storage, network bandwidth, processor bandwidth, or other items used by a resource consumer even those that pertain to other systems beyond computing, for example electrical power. "Physical Resource" means a physical instantiation of a resource at the device level. Examples of physical resources include video displays, hard disk drives, memory, processing units, media cards, network interfaces, power outlets, or other items used by resource consumers. Resources have granularity; therefore, "resource partition" means a logical representation of a portion of a physical resource. For example, a resource partition of a hard disk drive includes a logical partition on the disk that is addressable external to the disk. Another example of a resource partition includes a range of pixels in an (x,y) coordinate system on a video display representing a logical frame that is addressable external to the display. Further granularity is possible; therefore, "resource element" means a natural, logical unit of the resource that can be addressed. An example of a resource element for a storage system based on a hard disk drive includes a data block on the disk. Yet another example of a resource element includes a pixel of a video display.
On the other hand, the term "resource," as used herein does not include a file or other data object because data objects are not fungible.
"Resource Consumer" means an entity that utilizes a commodity in order to deliver its desired functionality. Resource consumers have the ability to operate independently of other resource consumers. Further, resource consumers share physical resources though each resource consumer could have a completely different perspective of the resource. Example resource consumers include applications, computers, operating systems, file systems, users, management software, or other entities that have interest in the commodity.
"Resource Map" means a translation from a virtual addressing scheme to a physical addressing scheme of a physical resource. Resource maps comprise software or data that translates a resource element ID into a physical address. Resource maps can split into a number of resource sub-maps where each sub-map comprises a further translation to a final physical address. Resource maps offer varying degrees of granularity of mapping virtual resource elements to physical locations. For example, a network storage system can offer a resource map with a granularity at the disk level, at the partition level, at the data block level, or even below the data block level by employing extra operations. "Split Resource Map" means a resource map split into a plurality of resources sub-maps. The resource sub-maps reside in individual memories where necessary.
"Resource Node" means a logical construct executing on a processor comprising software or firmware that provides a representation of a resource partition. Resource nodes comprise resource node information including and resource maps. Resource nodes have the ability to function independently of other resource nodes. This implies resource nodes do not require out-of-band management to communicate with other resource nodes in order to deliver a coherent disaggregated resource. An example of a resource node is a combination of hardware, software, or firmware that functions as a logical partition of a disk. The logical disk partition accepts read or write requests based on a logical block address (LBA) of a data block, and then performs the requested operation on the disk. Yet another example includes a combination of hardware, software, or firmware that functions as a logical video frame. The logical video frame similarly accepts pixel display commands then instructs a monitor to modify the pixel within the bounds of the logical frame. Resource nodes comprise a resource map to aid in the translation of a resource element ID, including an LBA or logical (x,y) coordinate of a pixel, to a physical address, a disk sector, track, cylinder, or (x,y) coordinate on a monitor.
"Resource Node Attribute" means an attribute within a resource node used by resource consumers external to the resource node to determine if the disaggregated resource suffers from a potential problem or to determine the resource node's role. Resource node attributes include coherency attributes. If a coherency attribute or collection of coherency attributes indicates a disaggregated resource has lost coherency, a resource consumer can then determine the best alternative steps to deal with a potential problem.
The teachings herein may be advantageously employed by developers of disaggregated systems to develop, manage, deploy, or utilize the systems with reduce costs and overhead. Disaggregate resources access methods can be used to manage large arrays of storage devices, arrays of video displays, or other resources that lend themselves to disaggregation.
Various objects, features, aspects, and advantages of the present invention will become more apparent from the following detailed description of the preferred embodiments of the invention, along with the accompanying drawings in which like numerals represent like components.
Brief Description of The Drawings
Figure 1 is a schematic of a disaggregated resource.
Figure 2 is a schematic of a possible physical embodiment for a resource node.
Figure 3 A is a schematic of an example of a parallel group of resource nodes composing a disaggregated resource.
Figure 3B is a schematic of an example of a serial group of resource nodes composing a disaggregated resource. Figure 3C is a schematic of an example of a combination group of serial and parallel groups of resource nodes composing a disaggregated resource.
Figure 4A is a schematic of an example of group types for a disaggregated storage array.
Figure 4B is a schematic of an example of group types for a disaggregated video display.
Figure 5 A is a schematic for an example of a physical arrangement of logical disk partitions composing a disaggregated storage array representing a logical volume Z.
Figure 5B is a schematic for an example of the logical organization of logical disk partitions composing a disaggregate storage array representing the logical volume Z of Figure 5A.
Figure 6 is a schematic for a resource map.
Figure 7 is a schematic for a split resource map.
Figure 8 illustrates resource maps with varying degrees of granularity.
Figure 9 is a schematic for an example method of accessing a disaggregated resource.
Detailed Description
In order to provide clarity for the detailed description of the inventive subject matter, two example embodiments are provided. Neither implementation should be interpreted as a limitation of the subject matter in any way. The first example embodiment focuses on a disaggregated storage array comprising resource nodes in the form of logical disk partitions and the second embodiment focuses a disaggregate video display comprising resource nodes in the form of logical video display frames.
Disaggregated Resources
Figure 1 presents a schematic for a disaggregated resource comprising more than one resource node. Disaggregated resource 100 comprises resource nodes HOA through 11OE. Although five resource nodes are shown, the number of resource nodes varies as necessary to fulfill the requirements of the disaggregated resource. Resource consumers 140A through 140N communicate with disaggregated resource 100 through communication path 150 by communicating with resource nodes HOA through 11OE.
In a preferred embodiment, resource consumers 140A through 140N take the fomi of an electrical device running a driver on a processing unit. The driver allows an application or applications running on the electrical device to interact with the resource provided by disaggregated resource 100 and resource nodes HOA through 11 OE in a manner that is transparent to the applications. Consequently, disaggregated resource 100 appears as a local resource from the perspective of the applications. Examples of resource consumers include computers, operating systems, file systems, management software, or other entities that require access to disaggregated resource 100.
Resource consumers 140A through 140N interact with disaggregated resource 100 independently from each other. Therefore, each resource consumer has its own view of disaggregated resource 100. Resource consumers 140A through 140N can alter resource node information stored on resource nodes 11 OA through 11 OE. It is contemplated that altering resource node information provides for indicating changes in the disaggregated resource.
Communication path 150 provides for a sustained communication between resource consumers 140A through 140N and resource nodes HOA through 11 OE. It is specifically contemplated that communication path 150 is unreliable and could result in one or more of the resource nodes 11OA through 11OE losing connectivity with resource consumers 140A through 140N. Contemplated communication paths include packet switched networks, computer busses, wireless transmission, or other communication mechanisms.
Resource Nodes
Figure 2 presents a schematic for a possible embodiment of a resource node.
Resource node 200 comprises processing unit 210 and memory 220. Processing unit 210 communicates with other entities, including resource consumers, over communication path 250. In addition processing unit 220 communicates with physical resources 260A through 260M over physical resource communication path 215. Processing unit 210 communicates with memory 220 over bus 225. Memory 220 further comprises resource node information 230 and resource map 240.
Memory 220 comprises resource node instructions that run on processing unit 210, and data for use by processing unit 210 or resource consumers. Data structures include resource node information 230, or resource map 240. Resource node information 230 includes one or more information elements as indicated by resource infoπnation elements 233A through 233N and conveys information regarding resource node 200 to resource consumers over communication path 250. Contemplated examples of resource node information include resource name, resource node type, resource node name, resource node attribute, address, or other resource node information of interest to a resource consumer. Resource consumers use resource node information to determine the role of a specific resource node within a disaggregated resource, amount other tilings. Resource map 240 provides processing unit information on how to translate requests for resource access from resource consumers into an access of physical resource 260A through 260M. It is contemplated that resource map 240 provides information on translating resource consumer requests across one or more physical resources, at a partition level of a physical resource, or at levels smaller than a partition. Resource node 200 uses resource map 240 to aid in determining the responsibility of resource node 200 with regards to physical resources 260A through 260M.
Contemplated embodiments of resource node 200 include modules comprising a combination of hardware, software, or firmware. Modules in the form of adapters plug into physical resource devices to provide resource node capabilities. Furthermore, adapters comprising embedded boards, chips, ASICs, FPGAs, or other components integrate into a larger physical resource device to provide resource node capabilities, hi addition, modules include enclosures that house one or more physical resource devices. Such modules include operating systems and communication stacks as firmware infrastructure to support resource node functionality. Because modules can comprise software, it is also contemplated software programs or drivers install on computers or other computer-like systems to provide resource node capabilities for the individual resources within the computer. Contemplated software or firmware executing on processing unit 210 that provides resource node 200 with its functionality including monolithic code that supports multiple resource nodes, collections of functions operating together, or individual tasks or threads. Resource node 200 operates as a virtual structure that presents a logical representation of a portion of physical resource 260A through 260M to resource consumers. Contemplated logical representations include logical partitions of disk drives or logical video frames of a monitor. The logical representations combine to form the complete disaggregated resource, which itself is a logical representation of a resource.
If resource node 200 exists in a module that supports multiple other resource nodes, it is contemplated all resource nodes have unique addresses. In a preferred embodiment, each resource node acquires an address from external to the enclosure, from a DHCP server for example. In especially preferred embodiment, multiple resource nodes share a common physical address, an Ethernet MAC address, when obtaining an address by using unique virtual physical addresses when communicating with the address server. For example, an enclosure can obtain multiple IP addresses from a DHCP server by using virtual physical addresses for each resource node in the "chaddr" field, the client address field, of the DHCP client request even though the module has a single real MAC address. This approach alleviates issues where a DHCP server does not correctly implement the DHCP client ID properly.
Disaggregated Resource Node Groups
A resource node comprises the ability to function independently of all other resource nodes and offers resource consumers access to the physical resource for which the resource node is responsible without regard to all other resource nodes composing the disaggregated resource. Consequently, resource nodes combine together forming different structures based on how a group as a whole is intended to function. In a preferred embodiment, there are four types of group structures, a group of one operating as a complete resource, a parallel group structure where resource nodes provide redundant resources, a serial group structure where resource nodes provide no redundancy, or a combination of parallel and serial groups. A single resource node that operates as a group of one providing a single complete resource is a trivial case and does not represent a disaggregated resource.
Figure 3 A illustrates an example of a parallel group of four resource nodes. Disaggregated resource 300 comprises resource nodes 305 A through 305D and represents a single resource from the perspective of resource consumer 310. Resource consumer 310 interacts with disaggregated resource 300 through communication path 315. Resource nodes 305A through 305D offer the same fundamental resource, resource "A," to resource consumer 310. Therefore, if one of resource nodes 305 A through 305D disappears from view of resource consumer 310, resource consumer 310 still is able to fully utilize disaggregated resource 300 even though disaggregated resource 300 has lost a resource node because the remaining nodes offer redundancy. An example of a parallel group includes a data storage array employing a number of mirrored disks. If a disk fails, the other mirrored disks provide data due to redundancy. An additional example of a parallel group includes a video display comprising multiple monitors displaying the same video data. Should one monitor fail, the remaining monitors continue to provide the video data. In a preferred embodiment, a parallel group comprises two or more logical disk partitions that mirror data from another group of one or more logical disk partitions. Although Figure 3 A presents an example of four resource nodes, it is contemplated that a parallel group can comprise one or more resource nodes. It is further contemplated that a parallel group can comprise one or more other groups. Other contemplated parallel groups include archives or backups where data is stored for long term storage.
Figure 3B illustrates an example of a serial group comprising four resource nodes. Disaggregated resource 320 comprises resource nodes 325A through 325D. Disaggregated resource 320 appears as a single resource to resource consumer 310. Resource consumer 310 communicates with disaggregated resource 320 over communication path 315. Resource node 325 A through 325D each offer a single piece of a resource, "A" through "D" respectively, where resource consumer 310 requires resource "ABCD" to operate properly. Resource node 325 A through 325D offers no redundancy; therefore, if one of the nodes disappears from view of resource consumer 310, disaggregate resource 320 would no longer be usable. Examples of serial groups for storage arrays include two or more disks comprising RAID-O stripes or two or more disks where the disks are spanned together to offer larger capacity. If one disk fails, the storage array is rendered useless until the array is fixed. Other contemplated serial groups including a parity group, or an individual group where a parity group represents a group that comprises parity information and where an individual represents a group of a single resource node. An alternative example of a serial group includes a composite video display comprising two or more monitors that are combined into a single larger display. If one monitor fails, then the complete display is not useful because some video information is missing. Figure 3C illustrates an example of a combination group comprising a parallel group of two redundant serial groups. Disaggregated resource 350 comprises serial group 330 and serial group 340, each offering the same resource "ABCD" to resource consumer 310. Resource consumer 310 communicates with disaggregated resource 350 over communication path 315. Resource nodes 335A through 335D provide access to resource "A" through "D" respectively as do resource nodes 345A through 345D. An example of a combination group includes a RAID 10 storage array where data is striped across multiple disks and those disks are mirrored. It a preferred embodiment, combination groups comprise two or more logical disk partitions. Furthermore, in a preferred embodiment combination groups comprise one or more other groups.
One or more resource nodes combine to form a group. One or more groups form an additional group. Finally, a disaggregated resource is a single top level group comprising one or more other groups. Groups become complex as resource nodes combine to offer various capabilities and services and because each resource node operates independently other nodes. Resource consumers require information to understand how resource nodes combine to form a disaggregated resource in order to properly interact with the disaggregated resource and to determine if the disaggregated resource has coherency.
In a preferred embodiment groups are classified by types where a type specifically references a role played by a collection of resource nodes. Furthermore, it is contemplated that resource consumers access a group collectively in addition to accessing individual resource nodes within the group, hi the preferred embodiment, resource consumers access the group collectively through multicast messages.
Resource Node Information
Resource consumers obtain a view of a disaggregate resource based on information about resource nodes. Resource nodes are unaware of their role in a larger disaggregated resource because they focus on their main responsibility of providing access to a physical resource. Consequently each individual resource node lacks information about the complete disaggregated resource. Resource consumers obtain resource node information from each of the individual resource nodes and combine the information to form a description of the disaggregated resource in order to properly interact with it. In a preferred embodiment, resource node information resides within in a memory associated with the resource node; however, it is contemplated the resource node information can reside in alternative memories.
If a disaggregated resource comprises only a serial group, then a resource consumer only needs sufficient information to deteπnine if the serial group has coherency, hi other words, the resource consumer only needs to know if members of the group are missing. If the serial group does not have coherency (members of the serial group are missing), the disaggregated resource is not useable. Therefore, a resource consumer does not need complete information, i.e. information about which members are missing, but rather sufficient information indicating that members are missing. Furthermore, if a disaggregated resource comprises parallel groups, the resource consumer only requires sufficient information to determine if, through the redundancy of the parallel groups, the disaggregated resource is usable. Again, the resource consumer only needs sufficient information rather the complete information regarding the disaggregated resource.
Resource node information comprises elements that include the group type of a group of which a resource node is a member, resource node attributes, resource name, resource node name, resource node address, or other information desired by resource consumers to determine the coherency of a disaggregated resource. Resource consumers also use resource node information to determine the role of each resource node. If a resource consumer knows the role of a resource node, then the resource consumers is able to partially assist the resource node by performing operation in anticipation of passing resource requests to a resource node. For example, if a resource node represents a RAED-O stripe, a resource consumer could perform necessary modulo arithmetic based on an LBA to determine if the stripe should receive a specific data block.
Group Types
Resource nodes combine together to form groups of different types. Each type of group represents a desired functionality, capability, or service. A group type provides resource consumers partial information regarding the role a group plays in the disaggregated resource.
Figure 4A presents a set of possible group types for resource nodes composing a disaggregated storage array. The resource nodes are logical disk partitions in this example. Assume the storage array comprises a data set of sequential data blocks "ABCDEFGHI" that could represent a single file. The resource map of each logical disk partition determines for which data blocks the logical partition is responsible. Logical disk partitions 410A and 410B are mirrors because they both are responsible for identical data and provide redundancy; therefore, each logical disk partition has a "mirror" group type. One should note that each logical disk partition is a group of one node in the example shown. Mirror groups are a form of parallel group and have resource that is redundant to another group. Logical disk partitions 420A through 420C represent a stripe group where individual data blocks are written across logical disk partitions 420A through 420C sequentially, then wrap around. Logical disk partitions 430A through 430B represent a span group where data fills one logical disk partition then flows over to the next logical disk partition. Both a stripe group and a span group are serial groups, hi a preferred embodiment, contemplated group types include mirrors, stripes, or spans. In especially preferred embodiments, contemplated mirror subtypes include snap-shots, backups, or deferred mirrors; contemplated stripe sub-types include parity. Snap-shots are a type of mirror group that captures a dataset at a specific point in time. Backups are a type of mirror group that archives a data set. Deferred mirrors are a type of mirror that has an I/O latency with respect to a primary group. Parity is a type of stripe comprising parity information. Parity groups are useful when implementing RAID-3, 4, 5, 6, or other storage arrays that require parity.
Figure 4B presents another example of a set of possible group types for resource nodes composing a disaggregated video display. The resource nodes are logical video frames in this example. Logical video frames 440A and 440B represent mirror groups where the same data is displayed on the two logical frames. Logical video frames 450A through 450F compose a composite display and are therefore a composite group. One should note the logical video frames 440A through 440F do not have to be located on the same monitor or have to be located physically close to each other.
The purpose of the previous examples is to illustrate that group types are created based on the type of disaggregated resource. In addition, there are common group types regardless of the type of disaggregated resource, mirrors for example. Composite types are essentially a combination of span and stripe. Groups can become quite complex depending on how the different group types combine to from a disaggregated resource. In a preferred embodiment groups are organized according to rules for manageability providing resource consumers structure so they can determine how groups should fit together and what role each group takes on. In a preferred embodiment, group types are hierarchically organized. For example in storage arrays it is contemplated that mirror groups are the top most group because each mirror represents one data set. The next level comprises stripe groups that provide performance enhancement, and then at the lowest level are span groups that provide capacity enhancement. It is contemplated that group type information is encoded in a memory. In a preferred embodiment, the group type information combines with a resource node name. In an especially preferred embodiment, group types also represent results of operations performed on other groups. For example in a disaggregated storage array, if a stripe group of logical disk partitions has a "mirror operation" performed on it, the result is a mirror group comprising two stripe groups, the original stripe group and a new stripe group.
Resource Node Names
Resource node names provide one or more pieces of information to remote consumers including identification or group membership information, logical position information within a disaggregated resource, partial role information, or other information that a resource consumer might need to access a disaggregated resource.
Identification and membership information allows a resource node to uniquely identify itself with respect to other resource nodes whether they are in the same disaggregated resource or from other disaggregated resources. Resource consumers can resolve the resource node name to an address which allows the resource consumer to communicate directly with a specific resource node. Resource node names and addresses ensure each resource node is distinguishable from other resource nodes. In a preferred embodiment, a resource node comprises the name of the disaggregated resource. In an especially preferred embodiment, a resource node name comprises a unique identifier including a UUID or a GUID. In addition, a resource node name of a preferred embodiment resolves to an internetworking address including a unicast address or a multicast address.
In a preferred embodiment, a resource node name comprises position information that informs a resource consumer where in a disaggregated resource a resource node fits. In an especially preferred embodiment the position information comprise a group type. Therefore, a resource node name carries partial resource node role information.
Figure 5 A provides an example of a physical structure of a possible disaggregated storage array comprising a number of resource nodes where the resource nodes are logical disk partitions. The example is provided to clarify how resource consumers can utilize resource node names. A resource consumer operates on data set comprising data blocks "ABCDEFGHI" stored on disks 501 through 505. The storage array comprises three complete, redundant data sets for reliability. Each disk comprises one or more partitions controlled by a resource node that represents each partition as an addressable, logical disk partition. Resource node 510 is a mirror partition comprising the complete data and is; therefore, a mirror group of one resource node. Mirror group 520 also comprises the complete data stored on three striped resource nodes 522, 524, and 526. Mirror group 530 also comprises the complete data set stored on two striped resource nodes 532 and 534 and span group 535 that functions as a third stripe. Span group 535 comprises resource nodes 537 and 539. From the perspective of a resource consumer, this collection of resource nodes appears as a single logical volume that stores data set "ABCDEFGHI." The resource consumer attempts to make sense of the structure through each resource node's name. Figure 5B presents a logical representation of the same disaggregated resource of Figure 5 A assuming groups are organized hierarchically as follows from top to bottom: mirrors, stripes, and spans. Logical volume 500 comprises the groups to foπn a single disaggregate resource.
Given the example hierarchical structure illustrated in Figure 5B, Table 1 lists a possible name associated with the eight nodes composing logical volume 500. Table 1 assumes logical volume 500 has name "Z" and that mirror groups are designated by the letter "M" and a number, hi addition, stripe groups are designated by the letter "R" and a number. "R" is used in the example to reflect stripes are a RAID 0 group. Span groups are designated by the letter "S."
Figure imgf000021_0001
Table 1
Resource node names provide a resource consumer enough information to understand the structure of the disaggregated resource represented by logical volume 500; however, the naming structure only partially describes the role played by each resource node. For example, resource node 532 represents mirrored data as indicated by the portion of the name "M2;" however, there is no indication that it is also a stripe. In the case of resource node 534, its name carries sufficient information to describe the node's role. One should note Table 1 presents one contemplated naming scheme. All other naming schemes that provide structure are contemplated. In a preferred embodiment, each resource node name reflects its initial membership of a group in a hierarchical fashion based on group type. Using only initial membership information allows for cases where groups are extended after first deployment without requiring the resource node names to be updated reducing management overhead and reducing the communication necessary with individual resource nodes. For example, when logical volume 500 is being built, root node 510 and resource node 522 are established with names "Z" and "Z.M1" respectively to create two mirrors of each other. As additional groups and nodes are added, including striped nodes, to add further functionality it is preferable to leave established names as they are rather than renaming nodes to reflect modifications without having to rename all nodes which would require intensive bookkeeping.
Through structured naming, resource consumers determine the overall structure of a disaggregated resource and identify gaps in the structure. For example, if resource node 524 does not respond to a discovery request by a resource consumer, the resource consumer understands there is a gap between resource node 522 with name "Z.M1" and resource node 526 with name "Z.M1.R2." Therefore, the resource consumer has sufficient information to deteπnine that mirror group 520 (as serial group of striped resource nodes) is incomplete and can not be used. However, given a structured naming scheme, a resource consumer can not know the last member in a group because it is always possible that the last resource node might not have responded. For example, if resource node 526 of mirror group 520 does not respond to a discovery request, the resource consumer would believe that resource node 524 is the last member of mirror group 520 and would not know that group is incomplete. It is contemplated that resource node names carry complete information regarding group memberships so that resource consumers can make a full determination of disaggregated resource coherency; however, such naming is inefficient because names can become long, disorganized, or unmanageable. It is also contemplated that resource node names have additional characteristics including being discoverable via discovery request message, or comprise information regarding resource node responsibilities.
Resource node names form two different classes: those that provide resource node role information and those that do not. Resource node role information helps resource consumers determine how to properly interact with individual resource nodes. For example in the case of a mirror group 520, resource nodes 522, 524, and 526 are striped nodes. A resource consumer can infer through the names of resource nodes 522, 524, and 526 are each stripes. Therefore, based on knowledge of how stripes operate (stripe depth, LBA ranges, module arithmetic, or other information), a resource consumer can determine which resource node is responsible for individual LBAs in a logical volume. Consequently, the resource consumer performs LBA calculations locally to determine which resource node is responsible for an LBA, then directly interacts with an individual resource node. In a preferred embodiment, the interaction is through a unicast message directed from the resource consumer to the address of a resource node. If resource nodes handle LBA calculations, then resource node names only have to carry organization information and do not have to carry resource node role information. In this case, a resource consumer directly interacts with the entire group collectively and lets each member of a group determine if any action is required on their part. For example, a resource consumer interacting with mirror group 520 sends a single message comprising an LBA to resource nodes 522, 524, and 526 collectively. It a preferred embodiment, the interaction is through a multicast message directed from the resource consumer to the multicast address of the group. Given this case, developers can use any naming or tagging system that results in a discemable structure for a disaggregated resource and is understandable by a resource consumer.
Resource Node Attributes
Resource node attributes contribute to determining the role of a resource node, to determining the coherency of the disaggregated resource, or to helping a resource consumer to properly interact with a disaggregated resource.
Resource consumers use resource node attributes to determine if a serial group of resource nodes is complete or has coherency. It is contemplated that resource nodes have coherency attributes where each attribute is associated with a group type. For example, in a storage array representing a disaggregated resource with mirror groups, stripe groups, and span groups, coherency attributes could include a mirror attribute, a stripe attribute, and span attribute, hi a preferred embodiment, each coherency attribute indicates that there are additional members, and therefore, nodes, associated with a particular group. Table 2 illustrates how coherency attributes contribute to a resource consumer's understanding of a disaggregated resource structure and the roles each node plays in the disaggregated resource based on logical volume 500 example presented in Figure 5B. In Table 2, the "M" attribute indicates that there is an additional mirror group; the "R" attribute indicates that there is an additional stripe member, and the "S" indicates there is an additional span member. Even though the attributes are set on the individual nodes, the attributes are associated with the groups because groups can comprise other groups that further comprise multiple resource nodes, for example mirror group 530 comprising span group 535 which functions as a stripe.
-99-
Figure imgf000024_0001
Table 2
A resource consumer interacting with logical volume 500 with attributes has sufficient information to determine if the serial groups comprising logical volume 500 have coherency and are complete. For example, the resource consumer can determine the completeness of mirror group 520, a serial group comprising striped resource nodes 522, 524, and 526, by checking the attributes even if a resource node is not visible. If resource node 526 is not visible to the resource consumer, then the resource consumer can determine that there are additional expected groups based on the "R" attribute being set on resource node 524. Furthermore, if all members of mirror group 520 are visible to the resource consumer, then the resource consumer can determine that there are no additional members because the "R" attribute is not set on resource node 526 indicating there are no additional stripes and, therefore, can determine that the group is complete.
The attributes also contribute information that allows a resource consumer to infer what role a resource node plays in the disaggregated resource. For example, resource node 522 has the name "Z.M1" which partially indicates its role, that it is a mirror. However, the name does not contain information regarding its role as a stripe. The resource consumer can infer that resource node 522 is stripe by detecting that the "R" attribute is set indicating that an additional stripe is expected implying resource node 522 is also a stripe.
Maintaining coherency of parallel groups requires more complex attributes because if one member of the parallel group is not visible, other resource nodes that are redundant can supply the resource in place of the lost group member. Coherency is important when multiple resource consumers access a shared disaggregated resource where the physical resource has state from the perspective of the resource consumers. Due to a number of reasons including unreliable networks, a resource consumer could loose track of a resource node in a parallel structure; consequently, that node could become out of synch with other resource nodes even though the complete disaggregated resource still provides functionality from the perspective of the resource consumer due to redundancy. For example, logical volume 500 is a parallel group comprising three mirror groups, root node 510, mirror group 520 (a serial group of striped resource nodes), and mirror group 530 (a serial group of striped and spanned resource nodes) all comprising identical data sets. If a first resource consumer can see only mirror groups 520 and 530, but not 510, then it could properly interact with the two visible groups. However, if a second resource consumer can see only mirror group 510, but not 530 and 520, then the second resource consumer can properly interact with group 510. Unfortunately, data on logical volume 500 is at risk if both resource consumers write to the volume because the volume no longer has coherency resulting in the groups losing synchronization. Coherency attributes can include a count of the number of groups or resource nodes comprising a parallel group. In the case of logical volume 500, there are three parallel groups supplying redundant data; therefore, the member count is three. Furthermore, a resource consumer, upon detection that a group or node is missing, can change a valid count on the visible resource nodes to indicate the how many groups are valid. For example, in the case of the first resource consumer seeing only mirror group 520 and 530 would set a valid count to two (as opposed to three). Any other resource consumer would see a discrepancy between the valid count (2) and member count (3) and know the disaggregated resource has lost coherency.
In a preferred embodiment, resource nodes have both a resource valid attribute indicating how many members of a group are valid and a resource member attribute indicating the number of expected members in the disaggregated resource. In a more preferred embodiment, resource nodes comprise a key to ensure that only one resource consumer is able to modify resource attributes at a time, or to indicate when group member resource node attributes are modified. Contemplated keys include random numbers, sequential counts, UUID, or GUIDs. Additionally, in a preferred embodiment resource consumers periodically check the coherency of a disaggregated resource to ensure it is able to services resource requests. In a preferred embodiment, through an advantageous selection of rules for creating an organizational structure of resource nodes, only one attribute is required for each group type rather than an attribute for each created group.
Additional resource node attributes are also contemplated. Specifically, attributes that indicate whether resource nodes are owned or share provide utility when multiple resource consumers interact with a disaggregated resource simultaneously.
Other Resource Node Information
Beyond group types, resource node names, and resource attributes, additional resource node information is contemplated including resource node capabilities, physical resource information, detailed resource node resource map information, or other information that enhances the operation of a disaggregated resource with respect to a resource consumer. Examples of resource node capabilities within storage array include block size, stripe size, logical partition capacity, or LBA responsibility information. Examples of resource node capabilities within a disaggregated video display include logical video frame size, pixel color depth, decoding information, or aspect ratio. All information regarding a resource node is contemplated.
Resource maps
Resource maps provide a translation from a resource element ID, an LBA or logical pixel coordinate for example, to a physical address, a hard disk storage block or a pixel on a video monitor for example. Within this context "physical address" means an address that is directed toward a physical resource; therefore a physical address includes intermediary addresses that are further directed toward the physical resource. Resource maps, especially those located within a resource node determine a resource node's responsibility.
Resource maps are either static or dynamic. Static resource maps are stored in a memory and contain sufficient information for translation. Dynamic resource maps are also stored in a memory and are built based on resource node information. Resource consumers build a dynamic map, if necessary, and determine if the map is complete through the resource node information. Figure 6 depicts an example of a resource map. Resource consumers access a disaggregated resource through the use of resource map 600 by referencing elements of the disaggregated resource via resource element IDs 610A through 610N. Resource map 600 includes a translation from resource element IDs 610A through.610N to physical addresses 620A through 620N. A resource map comprises a combination of software or data structures to achieve the goal of translation. It is contemplated that a resource map could comprise a table; however, the map would be an inefficient use of memory due to the number of resource elements IDs that have to be handled. In a storage array a typical number of IDs comprise 248 LBAs. In a video display, the number of pixels could easily be in the millions. In a preferred embodiment, resource map 600 comprises a software function that translates resource element IDs 610A through 610N to physical addresses 620A through 620N. Consequently, resource maps provide a translation from a virtual coordinate system used to represent the disaggregated resource to a physical coordinate system across physical resources. A resource consumer collects resource node information from resource nodes that compose a disaggregated resource. The resource consumer then constructs a dynamic resource map in memory based on resource node names, resource node attributes, resource node addresses, or group types. If the resource consumer has an understanding of the roles of each resource node, then the resource consumer can have an understanding of how resource element ID 610A through 610N are distributed across all the resource nodes. In the example of a disaggregated storage array where data is striped across a number of resource nodes, a resource map takes the form of software function employing modulo arithmetic to determine which resource node is responsible for an LBA. The physical address in this case would be the address of the node responsible for the LBA. In a preferred embodiment, a resource element ID translates to one or more physical address in order to support parallel groups including mirrors.
The complete map of a disaggregated resource splits into a number of smaller maps; each of them also a resource map. Resource consumers need sufficient information to create a map that allows it to interact with the disaggregated resource. Resource nodes have a fine grained mapping of a physical resource which is often a static map.
Figure 7 depicts an example of a split resource map of disaggregated storage array.
Split resource map 700 comprises resource sub-map 725 stored in first memory 720 and resource sub-map 735 stored in second memory 730. Resource sub-map 725 translates LBAs 726A through 726N to resource node addresses 728A through 728N. Resource sub-map 735 represents an example static resource map of a resource node addressed in resource sub-map 725. Resource sub-map further translates LBAs 736A through 736M to physical addresses 738 A through 738M representing a physical location on a disk drive. Resource sub-map 725 and resource sub-map 735 do not have to have the same number of elements because a resource node's resource map generally represents a fraction of the complete disaggregated resource. A resource consumer interacting with the example storage array uses resource sub- map 725 to find an address of a resource node. Resource consumer sends requests to a specific resource node, if required. The specific resource node could then use a resource map similar to example resource sub-map 735 to interact with a disk. Tn a preferred embodiment, resource maps and resource sub-maps employ software functions for translation. In an especially preferred embodiment, the software functions are used to yield resource node responsibilities including striping, parity, spanning, mirroring, or composite responsibilities.
Each resource map in a disaggregated resource comprises a level of granularity. Resource consumers often utilize coarse grain resource maps with a granularity at the resource node level. The resource node level corresponds to a granularity at physical partition level that is at or below a physical resource level. For example, in a storage array, a resource node represents a logical disk partition that corresponds to a disk partition that is smaller than a disk drive. In addition, in a video monitor, a resource node represents a logical video frame that corresponds to a frame that is smaller than the video monitor.
Figure 8 depicts several resource maps with various levels of granularity. Physical resource 840 comprises physical resource partition 843 and 845. A resource node is responsible for providing access to physical resource 840, and more particularly to resource partition 843. Resource map 810 has a granularity at the resource partition level because it translates resource element IDs to physical addresses that correspond to a physical resource partition level. In a preferred embodiment, the physical address of a physical resource partition corresponds to an address of a resource node. Physical resource partition 840 further comprises physical resource elements 844A through 844N. Resource map 820 has a granularity at the resource element level that is below the physical resource partition level because it can address individual physical resource elements 844 A through 844N.
Contemplated resource elements include addressable blocks on a disk drive. Resource nodes typically employ resource maps that have granularity at the physical resource element level. It is contemplated that resource map can have a granularity below the physical resource element level. Physical resource elements 844A through 844N comprise smaller pieces as depicted by atom 845. For example, disk drives have addressable blocks where each block comprises a number of bytes and each byte comprises a number of bits. Resource map 830 has a granularity at the atomic level. It is contemplated that resource nodes that employ resource maps having an atomic level granularity or below use multiple operations to access the physical resource. In the case of a storage array, a resource node would use a read- modify-write operation to change a byte within a block.
Accessing Disaggregated Resources
Figure 9 depicts a possible set of steps by which entities can interact with disaggregated resources.
At step 905, a manager creates a disaggregated resource. The manager responds to a need for the disaggregated resource and creates two or more resources nodes that have responsibility for physical resources in order to attempt to fulfill the need. For example, an IT manager establishes that an enterprise requires several terabytes of disk storage comprising mirrored data. Then, through utilities, the manager creates a number of logical disk partitions (resource nodes) representing partitions on a set of disk drives where the collection of logical disk partitions substantially fulfills the capacity and mirroring requirements. In a preferred embodiment, the manager uses a one or more software utilities designed to communicate with resource consumers, resource nodes, or modules that connect to physical resources. Given this, it is contemplated that a manager includes an administrator, a software package used to manage disaggregated resources, management servers, or management appliances. It is further contemplated that a manger includes a resource consumer, or a computer system external to the resource consumer and disaggregated resource.
At step 910, the manager establishes an organizational structure for the resource nodes. The organization structure provides definitions for the roles and responsibilities of each resource node. Once the roles and responsibilities are established, a manager creates a corresponding set of resource node information and resource maps for deployment into the resource nodes. Resource consumers use knowledge of the rules for forming the organizational structure in order to convert resource node information in a useable map of the disaggregated resource or determine if the disaggregated resource has coherency. In a preferred embodiment, the organizational structure comprises a hierarchy based on group types. In an especially preferred embodiment, the hierarchy groups include mirrors, stripes, composites, or spans. Other contemplated hierarchical structures include trees, or directed graphs. Non-hierarchical structures are also contemplated including flat sequential lists with no preferred ordering of resource nodes.
At step 915, a manager assigns resource node information and resource maps to each of the resource nodes composing the newly created disaggregated resource. Each resource node receives the resource node information and the resource map information then stores the information in a memory. The resource node information provides resource consumers information regarding what role the resource node takes on in the disaggregated resource. The resource map determines the responsibility of the resource node. In a preferred embodiment, resource node information includes group types, resource name, resource node name, or resource node attributes. Once the resource nodes composing the disaggregated resource have their resource node information and resource maps initialized, resource consumes can being accessing the disaggregated resource.
At step 920, a resource consumer wishes to access the disaggregated resource. If the resource consumer is not aware of the disaggregated resource, the resource consumer sends a discovery message to the disaggregated resource. Possible discovery methods include reading resource node from a local information store, sending broadcast message to resource nodes, using standardized protocols including SSDP, requesting resource node information from a server external to the disaggregated resource, or other methods that result in delivering resource node information to the resource consumer. In a preferred embodiment, discovery includes identifying the physical modules that comprise resource nodes and sending name resolution requests with possible regular expression information to all resource nodes. If a resource node receives the request and its name matches the regular expression, it will respond. Contemplated discovery requests include those sent via unicast to individual resource nodes, via multicast sent to groups of resource nodes, or via broadcasts sent to one or more groups of resource nodes.
At step 925, each resource node receives discovery messages sent by resource consumers. Upon reception and verification of authorization, if any authorization is required, of the discovery message, each individual resource node retrieves its own resource node information from its memory. The resource node information is formed in to a response message which is sent back to the resource consumer at step 930. The resource consumer attempts to collect responses from all responding resource nodes. It is contemplated that not all resource nodes will respond. In a preferred embodiment, resource consumers have the ability to cycle through multiple discovery attempts to capture as much resource node information as possible given an unreliable network.
At step 935, the resource consumer collects resource node information from responding nodes and constructs a resource map of the disaggregated resource. The resource consumer has an understanding of the rules of the organizational structure used to create the resource node information. Based on the rules and on the resource node information the resource consumer can determine if the constructed resource map is sufficiently complete or if the disaggregated resource has coherency. If the resource map is sufficiently complete, then the disaggregated resource is usable by the resource consumer. A sufficiently complete resource map is still considered complete as long as some members of a parallel group are visible. For example, if a storage array comprises a number of mirrored groups, each being a member of a parallel group, then one mirror could disappear from view, but the storage array would still be useful. If coherency is lost, the disaggregated resource might not be useable as is the case when coherency is lost in a serial group. At step 940, the resource consumer assesses the constructed resource map and at step 942 it decides if the disaggregated resource has coherency. If the disaggregated resource has coherency, then the resource consumer can begin using the disaggregated resource at step 950. If the disaggregated resource does not have coherency, then the resource consumer determines if the resource map has sufficient group members to still be useable. In a preferred embodiment, the resource consumer checks resource node names, group type, or resource node attributes to check for coherency.
Additionally, a preferred embodiment sorts the list of responding resource nodes, or removes duplicates, if any.
At step 944, the resource consumer decides if the remaining group members are able to provide sufficient capabilities to service requests from the resource consumer. This is possible in the case where there are sufficient parallel groups, or redundant resources. If there are sufficient capabilities remaining, the resource consumer will modify the resource node attributes at step 945 to indicate a change in coherency to be detected by other resource consumers. In a preferred embodiment, a resource consumer modifies coherency attributes including a resource node valid attribute. If the disaggregated resource is not usable, the resource consumer optionally reports an error or optionally attempts the discovery process again at step 920 to pick up lost members. It is contemplated that a resource consumer could probe for specific lost members based on detecting gaps within the map structure and on sending regular expression discovery messages tailored to check for names the could appear in the gaps. After modifying resource node attributes at step 945, the resource consumer can begin using the disaggregated resource at step 950.
At step 950, a resource consumer interacts with the disaggregated resource by sending messages to the disaggregated resource, hi a preferred embodiment, sending messages comprises sending packets over a packet switched network to all the resource nodes, to groups of resource nodes, or to individual resource nodes that compose the disaggregated resource. In an especially preferred embodiment, communications between the resource consumer and disaggregated resource take the form of UDP packets that are broadcast, multicast, or unicast.
In a preferred embodiment a resource consumer employs optimization techniques when communicating with resource nodes over an unreliable communication path. Contemplated optimizations techniques including employing a slow start or congestion avoidance similar to TCP to ensure the communication path is solid. One factor that contributes to a communication path being unreliable is the physical nature of the infrastructure. For example, hubs, switches, or routers form a packet switched network that could represent an unreliable communication path. Not all networking equipment is the same nor does the equipment faithfully implement protocols or standards. Consequently, in an especially preferred embodiment where resource consumers use UDP to communicate with resource slow start or congestion avoidance ensures networking equipment is not overwhelmed due to speed of communications.
At step 955, all resource nodes that are capable of receiving messages from a resource consumer, accept the messages. Because resource consumers can broadcast a single message to a group of resource nodes, one or more resource nodes within a group can receive the message. Each individual resource node determines if the message requires their individual attention based on their role or responsibility as set forth by their resource node information or resource maps. For example, a message from a resource consumer can include a resource element ID which the resource node compares against its resource map. If a resource node determines that is should not take any action, the message is silently discarded. If the resource node determines it should take action on the message, it proceeds to analyze the message.
At step 956 a resource node determines if the resource consumer message is a request for a resource for which the resource node is responsible. If the message is a non-resource specific message, then message is handled appropriately at step 985 and a response is generated and sent if necessary, and resource consumers processes the response at step 990. Non-resource request messages include configuration messages, management messages, firmware updates, or other messages that are of interest to the resource node beyond a request for the resource. If the message is a resource request, the message is further analyzed.
At step 960 in a preferred embodiment a resource node bypasses a communication stack to deliver the message directly to the main functionality of the resource node in order to improve over all performance of the resource node. It is contemplated that a resource node comprises a TCP/IP protocol stack that supports UDP packets. When a packet is accepted by the resource node at an Ethernet level, the resource node checks the packet and delivers the packet data directly the resource node application rather than requiring full processing of the TCP/IP stack eliminating superfluous computation. By bypassing the communication stack, costs are reduced because inexpensive processing units deliver more desirable performance than a more expensive, powerful unit that uses the full TCP/IP stack.
At step 962 the resource node checks the data in the resource request message to ensure that the resource node is able to handle the request or to determine if the request is an access fault. In a preferred embodiment resource request messages include one or more referenced (directly or indirectly) resource element IDs. The resource node checks its responsibilities to determine if it is able to handle the referenced resource element IDs. If some of the resource element IDs fall outside the scope of the resource node's responsibilities, then an access fault has occurred. The resource node reports the access fault back to the resource consumer and the resource consumer processes the response at step 990. In a preferred embodiment, a access fault shows a resource consumer that it is attempting to bridge resource nodes. For example, if two resource nodes form a span group in a storage array, when a resource consumer attempts write data for multiple LBAs, a bulk write, within a single message and the LBAs bridge from the first resource node to the second, the first resource node will generate an access fault. The resource consumer can then break the message up into individual requests each having a single LBA resulting in an aligned write to the resource nodes. If the message is not an access fault, the resource node continues its handling of the resource request.
At step 965 the resource node fully processes the resource request. The full processing depends on the type of resource for which the resource node is responsible. Contemplated processing includes, I/O processes, checking status, resource management, resource node information management, resource map management, or other processing as determined by the resource node's responsibilities or a protocol designed for use with the resource node.
In a preferred embodiment, it is contemplated that one type of processing that a resource node conducts is the processing of a proxy command at step 966. A proxy command instructs a resource node to communicate with another resource node as if the first resource node were a resource consumer. The first resource node then relays the message to the second resource node which is waiting at step 955.
At step 975 if the resource request is not a proxy command, the resource node continues processing the resource request and generates a response sent back to the resource consumer. The resource consumer receives the response at step 990.
It is contemplated that resource consumers and resource nodes communicate over an unreliable communication path that could result in loss of coherency of the disaggregated resource. Furthermore, multiple independent resource consumers access disaggregated resources simultaneously, each resource consumer having its own view of the disaggregated resource which could be out of synch with the other resource consumers. Therefore, in a preferred embodiment a resource consumer will periodically check for disaggregated resource coherency. At step 992 the resource consumer checks to see if coherency requires checking. If so, the resource consumer returns to step 920 and initiates a discovery. The periodicity of checking for coherency or conditions when coherency should be checked varies depending on the resource consumer and disaggregated resource environment. In a preferred embodiment, periodicities are greater than or equal to one second. Example Disaggregated Resources
The following examples illustrate how disaggregated resources can be employed to provide accessible resources for computer systems. Although a limited number of examples are presented, there are no implied restrictions or no implied limitations.
Example - Disaggregated Storage Array
An example of a disaggregated resource includes a data storage array comprising a plurality of hard disk drives. The storage array is virtualized by two or more resource nodes represented by logical disk partitions. Each logical disk partition has responsibility for a partition on a disk and comprises an IP address used to communicate with resource consumers. A logical disk partition also comprises a resource map that translates LBAs to physical disk locations used during I/O operations. The collection of logical disk partitions appears as a locally connect, logical volume to computer systems that represent resource consumers.
When the storage array is built, the logical disk partitions are created and given names that reflect their position and role in the group hierarchy of the logical volume. The group hierarchy includes groupings based on mirrors, stripes, or spans. The storage array represents itself as a single logical volume with a name. Each logical disk partition incorporates the logical volume name and group type information within the name of the logical disk partition. Furthermore, each logical disk partition also includes attributes indicating if there are more logical disk partitions in each type of group.
A resource consumer in the form of a software driver installs on a workstation beneath an operating system or file system. The low level placement of the driver allows existing software and applications to utilize the storage array without requiring modification. As software accesses the logical volume, the driver encapsulates FO requests into a packets sent to the logical volume. In a preferred embodiment, the requests are sent to the logical volume collectively through a multicast message where the logical partitions composing the logical volume are all members of the multicast group. It is contemplated that the driver communicates with the logical volume through unicast messages as well.
The driver sends a discovery request to the logical volume and the logical disk partitions respond with their individual information including their names or attributes. Once the driver collects the resource node information, it builds the initial coarse grained map of the logical volume where the map comprises the roles each logical partition plays and the addresses of the logical partitions. Once the map is assembled and determined to be complete, the driver begins communicating directly with logical volume. If the map is incomplete as determined based on the names and attributes, the driver attempts to find missing elements or possibly generates errors. If the logical volume comprises mirrors, the driver is able to determine that a group of partitions could have lost coherency with the rest of the data set by detecting changes in the coherency attributes including the resource node valid attribute.
The driver sends read and write requests to the logical volume or to individual logical partitions. As the logical disk partitions receive the requests, each logical disk partition checks the LBAs of the requests and compare the LBAs to the logical disk partition's map of the disk partition to determine if action is required. If no action is required, the logical disk partition silently discards the packet; otherwise, the packet is processed.
In a preferred embodiment, logical disk partitions employ several optimizations.
Packets from the driver can bypass the logical disk partition's communication stack to improve performance. In addition, packets are checked for access faults and the faults are reported back to the driver. The driver interprets the access fault as instructions to break requests up so individual logical disk partitions process the requests naturally without requiring logical disk partitions to communicate with each. Other interpretations of access faults are also contemplated in order to properly handle or report error conditions.
It is specifically contemplated that the data storage array uses modules attached to disk where the modules provide the logical disk partition functionality. Modules include rack mount enclosures that house 12 or 16 disks, desktop enclosures housing one to eight disks, single adapters that plug onto disks, or embedded modules integrated with other products.
Example - Disaggregated Video Display
Yet another example of a disaggregated resource includes a video display comprising a plurality of monitors. The video display is virtualized by two or more resource nodes represented by logical video frames where each logical video frame has an address and an extent of pixels, mostly likely in an (x,y) coordinate system. A logical video frame comprises a map that translates a virtual pixel address into a physical pixel address on one of the monitors. The collection of logical video frames appears as a locally connected video display from the perspective of an electrical device. Contemplated electrical devices include computers, TVs, video games, ATM machines, or other devices requiring a display.
The groups employed by the video display including composites groups where multiple logical video frames combine to form an aggregate display or mirror groups where the same video information is displayed more than once. It is contemplated a mirror includes a video recorder. The logical video frames also have names associated with their group types and the name of the video display. Coherency checks for the video display are useful to determine if a monitor has failed.
Just as in the storage example, a resource consumer for the video display can take the form of a driver. Furthermore, it is contemplated that a module comprising hardware, software, or firmware plugs into a video out of an electrical device and communicates with the video display transparently to the electrical device. For video streams, it is contemplated that image decoding can occur within the resource consumer, a third system, or within the logical video frame. Image decoding is required to determine which pixels of the image display should be mapped to which logical video frame and to provide for proper scaling for pixels.
Alternative Embodiments
In addition to the preceding examples alternative embodiments are also contemplated.
Alternative embodiments include other computing related resources including processors where the processor bandwidth is treated as a resource, memory where memory is segmented into partitions similar to a storage array, networking interfaces where QoS or bandwidth is treated as a resource, power supplies, audio I/O, input devices, or others. Alternative embodiments also include non-computing resources including products supplied through a distribution system where a number of warehouses represent resource nodes.
It is also contemplated that more than one disaggregated resource can combine to form a larger heterogeneous resource. For example, several disaggregated resources including a storage array, video display, or processing array combine to form a disaggregated computer. Hardware
In yet another aspect, it is contemplated that one could create hardware designed to adapt devices to allow the devices to function as an independent resource node, or in a more preferred embodiment, to function as multiple independent resource nodes where each resource node is responsible for a logical partition of the device. Therefore, the inventive subject matter includes hardware or firmware of such adapters as well as licensing, selling, advertising, managing, distributing, or operating the adapters.
Software
In still another aspect, it is contemplated that one could write software that would configure, simulate, or manage disaggregated resources and their associated infrastructure. From that perspective the inventive subject matter includes methods of writing such software, recording the software on a machine readable form, licensing, selling, distributing, installing, or operating such software on suitable hardware. Moreover, the software per se is deemed to fall within the scope of the inventive subject matter.
Advantages
Disaggregated resources provide a number of advantages. Individual resource nodes composing the disaggregated resource function independently of all other resource nodes which increases responsiveness or performance from the perspective of a resource consumer without requiring out-of-band communication. Each resource node provides partial information regarding the over map of the disaggregated resource allowing each resource consumer that requires access to the disaggregated resource to build its own effective map. This allows two resource consumers to share the same physical resources, or even the same logical resource, but access the disaggregated resources differently as required by their individual needs. For example, a first resource consumer could access one mirror group preferentially over another mirror group to reduce conflicts with the second resource consumer. Resource consumers are able to discover and access the disaggregated resource without working through an extraneous system that would incur cost overhead to a consumer. Finally disaggregated resources place resource map information with required granularity where it is necessary to ensure efficient access. Furthermore, the disaggregated resource can comprise resource maps with very fine levels of granularity offering a high degree of control over physical resources.
Thus, specific compositions and methods of disaggregated resources have been disclosed. It should be apparent, however, to those skilled in the art that many more modifications besides those already described are possible without departing from the inventive concepts herein. The inventive subject matter, therefore, is not to be restricted except in the spirit of the disclosure. Moreover, in interpreting the disclosure all terms should be interpreted in the broadest possible manner consistent with the context. In particular the terms "comprises" and "comprising" should be interpreted as referring to the elements, components, or steps in a non-exclusive manner, indicating that the referenced elements, components, or steps can be present, or utilized, or combined with other elements, components, or steps that are not expressly referenced.

Claims

CLAIMSWhat is claimed is:
1. A disaggregated resource spread among a least a first and second resource node, wherein the first and second resource node are distinguishable from each other and are both responsive to a resource discovery request by providing resource node information from which a resource consumer creates a resource map in a first memory, wherein the resource node information stored in a second memory from an individual one of the first and second resource node is incomplete with respect to the complete resource map.
2. The resource of claim 1, wherein the resource node information further comprises a resource node name.
3. The resource of claim 2, wherein the resource node name is a unique name.
4. The resource of claim 2, wherein the resource node name resolves to an internetworking protocol address.
5. The resource of claim 4, wherein the internetworking protocol address is a multicast address.
6. The resource of claim 1, wherein the resource node information further comprises a resource node group type.
7. The resource of claim 6, wherein the resource node group type includes at least one of a mirror, stripe, span, parity, composite, archive, or individual type.
8. The resource of claim 1, wherein the resource node information further comprises a resource node attribute.
9. The resource of claim 8, wherein the resource node attribute is a resource valid attribute
10. The resource of claim 8, wherein the resource node attribute is a resource member attribute.
11. The resource of claim 1 , wherein the resource map comprises a granularity at a resource partition level that is below the resource level.
12. The resource of claim 11 , wherein the resource map further comprises a granularity at a resource element level that is below the resource partition level.
13. The resource of claim 1, wherein the resource map further comprises split resource map comprising a first resource sub-map stored in first memory and a second resource sub-map stored in a second memory.
14. The resource of claim 1, wherein the resource map created by the resource consumer is different than a second resource map created by a second resource consumer.
15. The resource of claim 1, wherein the resource further comprises a storage medium.
16. The resource of claim 1, wherein the resource further comprises a video display.
17. The resource of claim 1, wherein the resource further comprises a processing unit.
IS. The resource of claim 1 , wherein the resource further comprises a memory.
19. The resource of claim 1, wherein the resource consumer and at least one of the first and second resource nodes communicate with each other over an unreliable communication path.
20. A method of accessing a disaggregated resource spread among at least a first and second resource node, wherein the first and second resource node are distinguishable and are both responsive to requests from a resource consumer by providing their individual resource node information stored in a first memory, said method comprising:
(a) Organizing the at least first and second resource nodes into an organizational structure that is understandable by the resource consumer;
(b) Assigning the resource node information to the at least first and second resource node wherein the resource node information is based on the organizational structure; (c) Assessing disaggregated resource coherency based on the resource node information from the at least first and second resource node; and
(d) Sending resource requests to the disaggregated resource.
21. The method of claim 20, wherein the resource node information comprises resource node group type.
22. The method of claim 21, wherein the resource node group type includes at least one of the following group types mirror, stripe, span, parity, composite, archive, and individual type
23. The method of claim 20, wherein the resource node information comprise a resource node attribute.
24. The method of claim 23, wherein the resource node attribute comprise a resource member attribute.
25. The method of claim 23, wherein the resource node attribute comprise a resource valid attribute.
26. The method of claim 25, further comprising modifying the resource valid attribute upon detection of loss of disaggregated resource coherency.
27. The method of claim 20, wherein the resource node information comprises a resource node name.
28. The method of claim 27, wherein the resource node name comprises a name associated with the disaggregated resource.
29. The method of claim 20 further comprising discovering the disaggregated resource by sending a discovery request to the disaggregated resource and collecting responses that comprise the resource node information from at least one of the at least first and second resource node.
30. The method of claim 29, further comprising creating a resource map in a second memory of the disaggregated resource based on the resource node information.
31. The method of claim 20, further comprising periodically assessing the coherency of the disaggregated resource.
32. The method of claim 31, wherein the period of checking occurs at a frequency of at least 10 seconds.
33. The method of claim 20, further comprising bypassing a communication stack of the at least first and second resource node to facilitate responsiveness to the requests.
34. The method of claim 20, further comprising generating access faults in response to the requests.
35. The method of claim 20, further comprising sending a proxy command from the first resource node to the second resource node.
36. A split resource map of a disaggregated resource spread among at least a first and second resource node, said map comprising:
(a) a first resource sub-map stored in a first memory, and
(b) a second resource sub-map stored in a second memory.
37. The resource map of claim 36, wherein the first resource sub-map comprises an address directed to the memory storing the second resource sub-map.
38. The resource map of claim 36, wherein the resource map comprises a granularity at a physical resource partition level that is below the physical resource level.
39. The resource map of claim 38, wherein the resource map further comprises a granularity at a physical resource element level that is below the physical resource partition level.
PCT/US2005/036022 2005-08-16 2005-10-06 Disaggregated resources and access methods WO2007021289A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
US11/205,895 US8819092B2 (en) 2005-08-16 2005-08-16 Disaggregated resources and access methods
US11/205,895 2005-08-16

Publications (1)

Publication Number Publication Date
WO2007021289A1 true WO2007021289A1 (en) 2007-02-22

Family

ID=35709035

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/US2005/036022 WO2007021289A1 (en) 2005-08-16 2005-10-06 Disaggregated resources and access methods

Country Status (2)

Country Link
US (3) US8819092B2 (en)
WO (1) WO2007021289A1 (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7636758B1 (en) 2008-07-10 2009-12-22 Gene Fein Advertisement forwarding storage and retrieval network
USRE47411E1 (en) 2005-08-16 2019-05-28 Rateze Remote Mgmt. L.L.C. Disaggregated resources and access methods

Families Citing this family (35)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8005918B2 (en) 2002-11-12 2011-08-23 Rateze Remote Mgmt. L.L.C. Data storage devices having IP capable partitions
US7620981B2 (en) 2005-05-26 2009-11-17 Charles William Frank Virtual devices and virtual bus tunnels, modules and methods
US8266634B2 (en) * 2007-10-12 2012-09-11 Microsoft Corporation Resource assignment system with recovery notification
US8707318B2 (en) 2007-10-12 2014-04-22 Microsoft Corporation Partitioning system including a generic partitioning manager for partitioning resources
KR101760333B1 (en) * 2011-03-02 2017-07-21 삼성전자주식회사 Communication method of target terminal and access point for group id management in mu-mimo transmission
US9069467B2 (en) * 2011-06-01 2015-06-30 Microsoft Technology Licensing, Llc Isolation of virtual machine I/O in multi-disk hosts
US10142411B2 (en) 2015-05-29 2018-11-27 Microsoft Technology Licensing, Llc Dynamic swarm segmentation
US9529538B1 (en) 2015-06-18 2016-12-27 International Business Machines Corporation Virtual accounting container for supporting small volumes of data
US9977613B2 (en) * 2015-12-30 2018-05-22 Dell Products L.P. Systems and methods for zone page allocation for shingled media recording disks
US10235198B2 (en) * 2016-02-24 2019-03-19 Samsung Electronics Co., Ltd. VM-aware FTL design for SR-IOV NVME SSD
US9961139B2 (en) * 2016-05-24 2018-05-01 International Business Machines Corporation Cooperative download among low-end devices under resource constrained environment
US10423487B2 (en) 2016-08-19 2019-09-24 Samsung Electronics Co., Ltd. Data protection offloads using SSD peering
US11138146B2 (en) * 2016-10-05 2021-10-05 Bamboo Systems Group Limited Hyperscale architecture
US11153164B2 (en) 2017-01-04 2021-10-19 International Business Machines Corporation Live, in-line hardware component upgrades in disaggregated systems
US10534598B2 (en) * 2017-01-04 2020-01-14 International Business Machines Corporation Rolling upgrades in disaggregated systems
US10768844B2 (en) 2018-05-15 2020-09-08 International Business Machines Corporation Internal striping inside a single device
US11650849B2 (en) 2018-09-25 2023-05-16 International Business Machines Corporation Efficient component communication through accelerator switching in disaggregated datacenters
US11163713B2 (en) 2018-09-25 2021-11-02 International Business Machines Corporation Efficient component communication through protocol switching in disaggregated datacenters
US11182322B2 (en) 2018-09-25 2021-11-23 International Business Machines Corporation Efficient component communication through resource rewiring in disaggregated datacenters
US10802988B2 (en) * 2018-09-25 2020-10-13 International Business Machines Corporation Dynamic memory-based communication in disaggregated datacenters
US10915493B2 (en) 2018-09-25 2021-02-09 International Business Machines Corporation Component building blocks and optimized compositions thereof in disaggregated datacenters
US10637733B2 (en) 2018-09-25 2020-04-28 International Business Machines Corporation Dynamic grouping and repurposing of general purpose links in disaggregated datacenters
US11012423B2 (en) 2018-09-25 2021-05-18 International Business Machines Corporation Maximizing resource utilization through efficient component communication in disaggregated datacenters
US10671557B2 (en) 2018-09-25 2020-06-02 International Business Machines Corporation Dynamic component communication using general purpose links between respectively pooled together of like typed devices in disaggregated datacenters
US10831698B2 (en) 2018-09-25 2020-11-10 International Business Machines Corporation Maximizing high link bandwidth utilization through efficient component communication in disaggregated datacenters
US10754720B2 (en) 2018-09-26 2020-08-25 International Business Machines Corporation Health check diagnostics of resources by instantiating workloads in disaggregated data centers
US10831580B2 (en) 2018-09-26 2020-11-10 International Business Machines Corporation Diagnostic health checking and replacement of resources in disaggregated data centers
US11050637B2 (en) 2018-09-26 2021-06-29 International Business Machines Corporation Resource lifecycle optimization in disaggregated data centers
US11188408B2 (en) 2018-09-26 2021-11-30 International Business Machines Corporation Preemptive resource replacement according to failure pattern analysis in disaggregated data centers
US10838803B2 (en) 2018-09-26 2020-11-17 International Business Machines Corporation Resource provisioning and replacement according to a resource failure analysis in disaggregated data centers
US10761915B2 (en) 2018-09-26 2020-09-01 International Business Machines Corporation Preemptive deep diagnostics and health checking of resources in disaggregated data centers
US11188429B2 (en) 2018-11-29 2021-11-30 International Business Machines Corporation Building a highly-resilient system with failure independence in a disaggregated compute environment
US11733902B2 (en) 2021-04-30 2023-08-22 International Business Machines Corporation Integrating and increasing performance of disaggregated memory in operating systems
US11902282B2 (en) 2021-05-28 2024-02-13 Capital One Services, Llc Validating compliance of roles with access permissions
US11562082B2 (en) * 2021-05-28 2023-01-24 Capital One Services, Llc Crafting effective policies for identity and access management roles

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6081879A (en) * 1997-11-04 2000-06-27 Adaptec, Inc. Data processing system and virtual partitioning method for creating logical multi-level units of online storage

Family Cites Families (279)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US4422171A (en) 1980-12-29 1983-12-20 Allied Corporation, Law Department Method and system for data communication
JPS6024647A (en) * 1983-07-20 1985-02-07 Hitachi Ltd Autonomous resource managing system of system
JPS6133054A (en) 1984-07-25 1986-02-15 Nec Corp Transmission and reception system of packet signal
JPS62233951A (en) 1986-04-02 1987-10-14 Nec Corp Multiplex packet transmission system
JPS6390942A (en) 1986-10-06 1988-04-21 Nec Corp Interexchange data link control system
US5129088A (en) * 1987-11-30 1992-07-07 International Business Machines Corporation Data processing method to create virtual disks from non-contiguous groups of logically contiguous addressable blocks of direct access storage device
JPH0792775B2 (en) * 1989-12-11 1995-10-09 株式会社日立製作所 Space management method for external storage devices
JP2911519B2 (en) 1990-02-06 1999-06-23 キヤノン株式会社 Photoelectric conversion device
US5544347A (en) 1990-09-24 1996-08-06 Emc Corporation Data storage system controlled remote data mirroring with respectively maintained data indices
AU8683991A (en) 1990-11-09 1992-05-14 Array Technology Corporation Logical partitioning of a redundant array storage system
DE69129298T2 (en) * 1991-02-05 1998-11-19 Ibm GUIDE CONTROL FOR TRANSACTION COMMANDS
EP0551009B1 (en) * 1992-01-08 2001-06-13 Emc Corporation Method for synchronizing reserved areas in a redundant storage array
JP2868141B2 (en) * 1992-03-16 1999-03-10 株式会社日立製作所 Disk array device
JPH05347623A (en) 1992-04-07 1993-12-27 Nec Corp Multi-cast communication system
US5457683A (en) * 1993-05-07 1995-10-10 Apple Computer, Inc. Link and discovery protocols for a ring interconnect architecture
US5444709A (en) 1993-09-30 1995-08-22 Apple Computer, Inc. Protocol for transporting real time data
US5771354A (en) 1993-11-04 1998-06-23 Crawford; Christopher M. Internet online backup system provides remote storage for customers using IDs and passwords which were interactively established when signing up for backup services
WO1995013583A1 (en) * 1993-11-09 1995-05-18 Conner Kenneth H First come memory accessing without conflict
JP3249868B2 (en) 1993-11-19 2002-01-21 株式会社日立製作所 Array type storage system
US5506969A (en) * 1993-11-29 1996-04-09 Sun Microsystems, Inc. Method and apparatus for bus bandwidth management
CA2138696C (en) 1993-12-28 2001-02-27 Kenji Kataoka Ic card radio modem and communication system using the same
JPH07325779A (en) 1994-06-01 1995-12-12 Fuji Xerox Co Ltd Input/output controller
US5617449A (en) 1994-06-27 1997-04-01 Nec Corporation IC memory card type radio modem
US5623605A (en) 1994-08-29 1997-04-22 Lucent Technologies Inc. Methods and systems for interprocess communication and inter-network data transfer
US5615352A (en) 1994-10-05 1997-03-25 Hewlett-Packard Company Methods for adding storage disks to a hierarchic disk array while maintaining data availability
JPH08251198A (en) 1995-03-15 1996-09-27 Matsushita Electric Ind Co Ltd Multiplex transmission device
US6396480B1 (en) 1995-07-17 2002-05-28 Gateway, Inc. Context sensitive remote control groups
US5930786A (en) 1995-10-20 1999-07-27 Ncr Corporation Method and apparatus for providing shared data to a requesting client
US5680461A (en) 1995-10-26 1997-10-21 Sun Microsystems, Inc. Secure network protocol system and method
US5948062A (en) 1995-10-27 1999-09-07 Emc Corporation Network file server using a cached disk array storing a network file directory including file locking information and data mover computers each having file system software for shared read-write file access
US5758188A (en) * 1995-11-21 1998-05-26 Quantum Corporation Synchronous DMA burst transfer protocol having the peripheral device toggle the strobe signal such that data is latched using both edges of the strobe signal
JPH09149060A (en) 1995-11-27 1997-06-06 Mitsubishi Electric Corp Multiple address communication system in multi-distribution connection network
US5758050A (en) * 1996-03-12 1998-05-26 International Business Machines Corporation Reconfigurable data storage system
US5742604A (en) * 1996-03-28 1998-04-21 Cisco Systems, Inc. Interswitch link mechanism for connecting high-performance network switches
US5901327A (en) 1996-05-28 1999-05-04 Emc Corporation Bundling of write data from channel commands in a command chain for transmission over a data link between data storage systems for remote data mirroring
US6886035B2 (en) 1996-08-02 2005-04-26 Hewlett-Packard Development Company, L.P. Dynamic load balancing of a network of client and server computer
US5949977A (en) 1996-10-08 1999-09-07 Aubeta Technology, Llc Method and apparatus for requesting and processing services from a plurality of nodes connected via common communication links
US6202060B1 (en) * 1996-10-29 2001-03-13 Bao Q. Tran Data management system
US6157935A (en) 1996-12-17 2000-12-05 Tran; Bao Q. Remote data access and management system
US5991891A (en) * 1996-12-23 1999-11-23 Lsi Logic Corporation Method and apparatus for providing loop coherency
US6710786B1 (en) 1997-02-03 2004-03-23 Oracle International Corporation Method and apparatus for incorporating state information into a URL
WO1998036585A2 (en) 1997-02-18 1998-08-20 Northern Telecom Inc. Sponsored call and cell service
US7389312B2 (en) 1997-04-28 2008-06-17 Emc Corporation Mirroring network data to establish virtual storage area network
US5884038A (en) 1997-05-02 1999-03-16 Whowhere? Inc. Method for providing an Internet protocol address with a domain name server
JP3228182B2 (en) 1997-05-29 2001-11-12 株式会社日立製作所 Storage system and method for accessing storage system
JP2000513916A (en) * 1997-06-25 2000-10-17 サムソン エレクトロニクス カンパニー リミテッド Method and apparatus for home network automatic tree generator
US6295584B1 (en) * 1997-08-29 2001-09-25 International Business Machines Corporation Multiprocessor computer system with memory map translation
US6385638B1 (en) 1997-09-04 2002-05-07 Equator Technologies, Inc. Processor resource distributor and method
US7237036B2 (en) * 1997-10-14 2007-06-26 Alacritech, Inc. Fast-path apparatus for receiving data corresponding a TCP connection
US6434620B1 (en) * 1998-08-27 2002-08-13 Alacritech, Inc. TCP/IP offload network interface device
JPH11122301A (en) 1997-10-20 1999-04-30 Fujitsu Ltd Address conversion connection device
US6101559A (en) 1997-10-22 2000-08-08 Compaq Computer Corporation System for identifying the physical location of one or more peripheral devices by selecting icons on a display representing the one or more peripheral devices
US5937169A (en) * 1997-10-29 1999-08-10 3Com Corporation Offload of TCP segmentation to a smart adapter
US5983024A (en) 1997-11-26 1999-11-09 Honeywell, Inc. Method and apparatus for robust data broadcast on a peripheral component interconnect bus
US6018779A (en) * 1997-12-15 2000-01-25 Emc Corporation System for encapsulating a plurality of selected commands within a single command and transmitting the single command to a remote device over a communication link therewith
US6029168A (en) 1998-01-23 2000-02-22 Tricord Systems, Inc. Decentralized file mapping in a striped network file system in a distributed computing environment
US6105122A (en) 1998-02-06 2000-08-15 Ncr Corporation I/O protocol for highly configurable multi-node processing system
US6253273B1 (en) * 1998-02-06 2001-06-26 Emc Corporation Lock mechanism
US6246683B1 (en) * 1998-05-01 2001-06-12 3Com Corporation Receive processing with network protocol bypass
US6931430B1 (en) * 1998-05-13 2005-08-16 Thomas W. Lynch Maintaining coherency in a symbiotic computing system and method of operation thereof
KR100272165B1 (en) * 1998-05-20 2000-11-15 윤종용 Cache memory system and method thereof
US6259448B1 (en) 1998-06-03 2001-07-10 International Business Machines Corporation Resource model configuration and deployment in a distributed computer network
US6330236B1 (en) 1998-06-11 2001-12-11 Synchrodyne Networks, Inc. Packet switching method with time-based routing
US6377990B1 (en) 1998-06-15 2002-04-23 Lodgenet Entertainment Corporation System for providing internet access from locations different from those for which the user's software was configured
US6157955A (en) * 1998-06-15 2000-12-05 Intel Corporation Packet processing system including a policy engine having a classification unit
US6260120B1 (en) * 1998-06-29 2001-07-10 Emc Corporation Storage mapping and partitioning among multiple host processors in the presence of login state changes and host controller replacement
US6542909B1 (en) * 1998-06-30 2003-04-01 Emc Corporation System for determining mapping of logical objects in a computer system
US6449607B1 (en) 1998-09-11 2002-09-10 Hitachi, Ltd. Disk storage with modifiable data management function
US6330616B1 (en) 1998-09-14 2001-12-11 International Business Machines Corporation System for communications of multiple partitions employing host-network interface, and address resolution protocol for constructing data frame format according to client format
US6330615B1 (en) 1998-09-14 2001-12-11 International Business Machines Corporation Method of using address resolution protocol for constructing data frame formats for multiple partitions host network interface communications
DE69836326T2 (en) * 1998-09-18 2007-06-14 Alcatel Canada Inc., Kanata Method for cellular data transmission, cell, communication system and corresponding network terminal
JP4412685B2 (en) * 1998-09-28 2010-02-10 株式会社日立製作所 Storage controller and method of handling data storage system using the same
US6473774B1 (en) 1998-09-28 2002-10-29 Compaq Computer Corporation Method and apparatus for record addressing in partitioned files
US6618743B1 (en) 1998-10-09 2003-09-09 Oneworld Internetworking, Inc. Method and system for providing discrete user cells in a UNIX-based environment
US6654891B1 (en) 1998-10-29 2003-11-25 Nortel Networks Limited Trusted network binding using LDAP (lightweight directory access protocol)
US6502135B1 (en) * 1998-10-30 2002-12-31 Science Applications International Corporation Agile network protocol for secure communications with assured system availability
US6571274B1 (en) 1998-11-05 2003-05-27 Beas Systems, Inc. Clustered enterprise Java™ in a secure distributed processing system
US6597680B1 (en) 1998-11-16 2003-07-22 Telefonaktiebolaget Lm Ericsson (Publ) Packet traffic channel reassignment
US6360270B1 (en) * 1998-11-16 2002-03-19 Hewlett-Packard Company Hybrid and predictive admission control strategies for a server
FR2786892B3 (en) 1998-12-07 2000-12-29 Schneider Automation PROGRAMMABLE PLC COUPLER
US6728210B1 (en) 1998-12-21 2004-04-27 Nec America, Inc. Multi-logical access for a serial data link
US6434147B1 (en) 1999-01-08 2002-08-13 Nortel Netwoks Limited Method and system for sequential ordering of missing sequence numbers in SREJ frames in a telecommunication system
US6587464B1 (en) 1999-01-08 2003-07-01 Nortel Networks Limited Method and system for partial reporting of missing information frames in a telecommunication system
US6466571B1 (en) 1999-01-19 2002-10-15 3Com Corporation Radius-based mobile internet protocol (IP) address-to-mobile identification number mapping for wireless communication
US6470342B1 (en) 1999-03-12 2002-10-22 Compaq Computer Corporation Process of maintaining a distributed map of transaction identifiers and using hashing to access these maps
JP2000267979A (en) 1999-03-12 2000-09-29 Nec Corp Storage system
US6401183B1 (en) 1999-04-01 2002-06-04 Flash Vos, Inc. System and method for operating system independent storage management
US6701432B1 (en) * 1999-04-01 2004-03-02 Netscreen Technologies, Inc. Firewall including local bus
US6356929B1 (en) 1999-04-07 2002-03-12 International Business Machines Corporation Computer system and method for sharing a job with other computers on a computer network using IP multicast
US6553028B1 (en) 1999-04-30 2003-04-22 Cisco Technology, Inc. Method and apparatus for multicast switching using a centralized switching engine
US6487555B1 (en) 1999-05-07 2002-11-26 Alta Vista Company Method and apparatus for finding mirrored hosts by analyzing connectivity and IP addresses
US6275898B1 (en) 1999-05-13 2001-08-14 Lsi Logic Corporation Methods and structure for RAID level migration within a logical unit
JP3685651B2 (en) 1999-06-04 2005-08-24 沖電気工業株式会社 Interconnect apparatus and active QoS mapping method
US6892230B1 (en) 1999-06-11 2005-05-10 Microsoft Corporation Dynamic self-configuration for ad hoc peer networking using mark-up language formated description messages
US6910068B2 (en) 1999-06-11 2005-06-21 Microsoft Corporation XML-based template language for devices and services
WO2000079362A2 (en) 1999-06-23 2000-12-28 Mangosoft Corporation Distributed virtual web cache implemented entirely in software
US6668378B2 (en) 1999-06-30 2003-12-23 Webtv Networks, Inc. Interactive television triggers having connected content/disconnected content attribute
US7206805B1 (en) * 1999-09-09 2007-04-17 Oracle International Corporation Asynchronous transcription object management system
JP4331835B2 (en) 1999-09-22 2009-09-16 パナソニック株式会社 Image data transmission method
US6882637B1 (en) 1999-10-14 2005-04-19 Nokia Networks Oy Method and system for transmitting and receiving packets
US7333451B1 (en) 1999-10-18 2008-02-19 Nortel Networks Limited Buffer management for mobile internet protocol
US6732230B1 (en) 1999-10-20 2004-05-04 Lsi Logic Corporation Method of automatically migrating information from a source to an assemblage of structured data carriers and associated system and assemblage of data carriers
US6711164B1 (en) 1999-11-05 2004-03-23 Nokia Corporation Method and apparatus for performing IP-ID regeneration to improve header compression efficiency
US6535925B1 (en) 1999-11-09 2003-03-18 Telefonaktiebolaget L M Ericsson (Publ) Packet header compression using division remainders
US6678241B1 (en) 1999-11-30 2004-01-13 Cisc Technology, Inc. Fast convergence with topology switching
JP2001160828A (en) 1999-12-03 2001-06-12 Matsushita Electric Ind Co Ltd Vpn communication method in security gateway device
US6389448B1 (en) 1999-12-06 2002-05-14 Warp Solutions, Inc. System and method for load balancing
JP3959583B2 (en) 1999-12-10 2007-08-15 ソニー株式会社 Recording system
JP2001166993A (en) 1999-12-13 2001-06-22 Hitachi Ltd Memory control unit and method for controlling cache memory
US6742034B1 (en) 1999-12-16 2004-05-25 Dell Products L.P. Method for storage device masking in a storage area network and storage controller and storage subsystem for using such a method
US6701431B2 (en) 2000-01-28 2004-03-02 Infineon Technologies Ag Method of generating a configuration for a configurable spread spectrum communication device
FR2804816B1 (en) * 2000-02-03 2003-10-31 Gemplus Card Int TRANSPORT OF PROTOCOL UNITS OF PORTABLE ELECTRONIC OBJECT BY PROTOCOL FOR MICROCOMPUTER DEVICES
US6834326B1 (en) * 2000-02-04 2004-12-21 3Com Corporation RAID method and device with network protocol between controller and storage devices
US20010034758A1 (en) 2000-02-24 2001-10-25 Dan Kikinis Virtual private network (VPN) for servicing home gateway system through external disk management
US7324228B2 (en) 2000-02-25 2008-01-29 Hewlett-Packard Development Company, L.P. System and method for downloading and for printing data from an external content source
US7225243B1 (en) 2000-03-14 2007-05-29 Adaptec, Inc. Device discovery methods and systems implementing the same
US6826613B1 (en) 2000-03-15 2004-11-30 3Com Corporation Virtually addressing storage devices through a switch
US6601101B1 (en) * 2000-03-15 2003-07-29 3Com Corporation Transparent access to network attached devices
US6975629B2 (en) 2000-03-22 2005-12-13 Texas Instruments Incorporated Processing packets based on deadline intervals
US6947430B2 (en) * 2000-03-24 2005-09-20 International Business Machines Corporation Network adapter with embedded deep packet processing
US6882648B2 (en) 2000-03-29 2005-04-19 Fujitsu Limited Communication device
US6629264B1 (en) * 2000-03-30 2003-09-30 Hewlett-Packard Development Company, L.P. Controller-based remote copy system with logical unit grouping
US6993587B1 (en) 2000-04-07 2006-01-31 Network Appliance Inc. Method and apparatus for election of group leaders in a distributed network
US7463582B2 (en) 2000-04-14 2008-12-09 Hughes Network Systems, Llc System and method for scaling a two-way satellite system
US20030041138A1 (en) 2000-05-02 2003-02-27 Sun Microsystems, Inc. Cluster membership monitor
US6826561B2 (en) * 2000-05-22 2004-11-30 Broadcom Corporation Method and apparatus for performing a binary search on an expanded tree
JP4168574B2 (en) * 2000-06-02 2008-10-22 株式会社日立製作所 Packet transfer apparatus, packet transfer control method, and packet transfer apparatus setting method
EP1178407A3 (en) 2000-06-02 2007-12-12 Compaq Computer Corporation Architecture for parallel distributed table driven I/O mapping
US7051087B1 (en) 2000-06-05 2006-05-23 Microsoft Corporation System and method for automatic detection and configuration of network parameters
US6629162B1 (en) 2000-06-08 2003-09-30 International Business Machines Corporation System, method, and product in a logically partitioned system for prohibiting I/O adapters from accessing memory assigned to other partitions during DMA
US6681244B1 (en) * 2000-06-09 2004-01-20 3Com Corporation System and method for operating a network adapter when an associated network computing system is in a low-power state
JP3890856B2 (en) 2000-06-12 2007-03-07 ヤマハ株式会社 Wireless audio equipment
US6894976B1 (en) 2000-06-15 2005-05-17 Network Appliance, Inc. Prevention and detection of IP identification wraparound errors
US6629178B1 (en) 2000-06-15 2003-09-30 Advanced Micro Devices, Inc. System and method for controlling bus access for bus agents having varying priorities
US7966421B2 (en) * 2000-06-21 2011-06-21 SAtech Group, A.B. Limited Liability Company Method and apparatus for logically expanding the length of a search key
WO2002009458A2 (en) 2000-07-24 2002-01-31 Bluesocket, Inc. Method and system for enabling seamless roaming in a wireless network
US6754662B1 (en) * 2000-08-01 2004-06-22 Nortel Networks Limited Method and apparatus for fast and consistent packet classification via efficient hash-caching
WO2002015018A1 (en) 2000-08-11 2002-02-21 3Ware, Inc. Architecture for providing block-level storage access over a computer network
US7278142B2 (en) 2000-08-24 2007-10-02 Veritas Operating Corporation Dynamic computing environment using remotely allocable resources
JP3555568B2 (en) 2000-09-04 2004-08-18 日本電気株式会社 IP telephone recording system
US7389358B1 (en) 2000-09-13 2008-06-17 Fortinet, Inc. Distributed virtual system to support managed, network-based services
US6977927B1 (en) 2000-09-18 2005-12-20 Hewlett-Packard Development Company, L.P. Method and system of allocating storage resources in a storage area network
US6928473B1 (en) 2000-09-26 2005-08-09 Microsoft Corporation Measuring network jitter on application packet flows
US6854021B1 (en) 2000-10-02 2005-02-08 International Business Machines Corporation Communications between partitions within a logically partitioned computer
US6853382B1 (en) 2000-10-13 2005-02-08 Nvidia Corporation Controller for a memory system having multiple partitions
JP3522250B2 (en) 2000-10-27 2004-04-26 株式会社ソニー・コンピュータエンタテインメント Partition creation method and deletion method, recording medium recording program, and information processing apparatus
US6862648B2 (en) 2000-10-30 2005-03-01 Sun Microsystems, Inc. Interface emulation for storage devices
US6978271B1 (en) 2000-10-31 2005-12-20 Unisys Corporation Mechanism for continuable calls to partially traverse a dynamic general tree
US6985956B2 (en) 2000-11-02 2006-01-10 Sun Microsystems, Inc. Switching system
US6434683B1 (en) 2000-11-07 2002-08-13 Storage Technology Corporation Method and system for transferring delta difference data to a storage device
ITBO20000653A1 (en) 2000-11-14 2002-05-14 Sichera Di Sichera Gianni METHOD FOR INSERTING A COVER SHEET BETWEEN A CLEAR FILM AND A CONTAINER WITH BOOK OPENING AND DEVICE THAT IMPLEMENTS
US6601135B1 (en) 2000-11-16 2003-07-29 International Business Machines Corporation No-integrity logical volume management method and system
US7406523B1 (en) 2000-11-21 2008-07-29 Microsoft Corporation Client-server communications system and method using a semi-connectionless protocol
US7065568B2 (en) * 2000-11-30 2006-06-20 Microsoft Corporation System and method for managing states and user context over stateless protocols
US6757845B2 (en) * 2000-11-30 2004-06-29 Bitmicro Networks, Inc. Method and apparatus for testing a storage device
EP1598714B1 (en) 2000-12-13 2016-09-28 LG Electronics Inc. Apparatus and method for remotely controlling household appliances
US6876657B1 (en) * 2000-12-14 2005-04-05 Chiaro Networks, Ltd. System and method for router packet control and ordering
US7234029B2 (en) * 2000-12-28 2007-06-19 Intel Corporation Method and apparatus for reducing memory latency in a cache coherent multi-node architecture
US7200641B1 (en) 2000-12-29 2007-04-03 Emc Corporation Method and system for encoding SCSI requests for transmission using TCP/IP
JP3392828B2 (en) 2001-01-10 2003-03-31 株式会社東芝 Distributed processing system, drawing command transfer method in the system, and computer-readable storage medium
WO2002057917A2 (en) 2001-01-22 2002-07-25 Sun Microsystems, Inc. Peer-to-peer network computing platform
US7260633B2 (en) * 2001-01-25 2007-08-21 Microsoft Corporation System and method for processing requests from newly registered remote application consumers
WO2002061599A1 (en) 2001-01-25 2002-08-08 Crescent Networks, Inc. Extension of address resolution protocol (arp) for internet protocol (ip) virtual networks
US7203730B1 (en) 2001-02-13 2007-04-10 Network Appliance, Inc. Method and apparatus for identifying storage devices
WO2002065329A1 (en) 2001-02-14 2002-08-22 The Escher Group, Ltd. Peer-to peer enterprise storage
JP2002252880A (en) 2001-02-26 2002-09-06 Sanyo Electric Co Ltd Liquid crystal projector
WO2002071775A1 (en) 2001-03-01 2002-09-12 Mitsubishi Denki Kabushiki Kaisha Mobile ip packet communication system
US7145866B1 (en) 2001-03-01 2006-12-05 Emc Corporation Virtual network devices
JP3470756B2 (en) 2001-03-06 2003-11-25 日本電気株式会社 Main device address restriction notification system
US20020133539A1 (en) 2001-03-14 2002-09-19 Imation Corp. Dynamic logical storage volumes
US7072823B2 (en) 2001-03-30 2006-07-04 Intransa, Inc. Method and apparatus for accessing memory using Ethernet packets
US6983326B1 (en) 2001-04-06 2006-01-03 Networks Associates Technology, Inc. System and method for distributed function discovery in a peer-to-peer network environment
JP3972596B2 (en) 2001-04-20 2007-09-05 株式会社日立製作所 Disk array system
US20020165978A1 (en) 2001-05-07 2002-11-07 Terence Chui Multi-service optical infiniband router
US20020184327A1 (en) * 2001-05-11 2002-12-05 Major Robert Drew System and method for partitioning address space in a proxy cache server cluster
US7145919B2 (en) 2001-06-01 2006-12-05 Telefonaktienbolaget Lm Ericsson (Publ) Method and apparatus for transporting different classes of data bits in a payload over a radio interface
US20030026246A1 (en) * 2001-06-06 2003-02-06 Zarlink Semiconductor V.N. Inc. Cached IP routing tree for longest prefix search
US7613806B2 (en) 2001-06-28 2009-11-03 Emc Corporation System and method for managing replication sets of data distributed over one or more computer systems
US6636958B2 (en) 2001-07-17 2003-10-21 International Business Machines Corporation Appliance server with a drive partitioning scheme that accommodates application growth in size
KR100392382B1 (en) 2001-07-27 2003-07-23 한국전자통신연구원 Method of The Logical Volume Manager supporting Dynamic Online resizing and Software RAID
US7363310B2 (en) * 2001-09-04 2008-04-22 Timebase Pty Limited Mapping of data from XML to SQL
US20030079018A1 (en) 2001-09-28 2003-04-24 Lolayekar Santosh C. Load balancing in a storage network
US7404000B2 (en) 2001-09-28 2008-07-22 Emc Corporation Protocol translation in a storage system
US7185062B2 (en) 2001-09-28 2007-02-27 Emc Corporation Switch-based storage services
US7558264B1 (en) 2001-09-28 2009-07-07 Emc Corporation Packet classification in a storage system
US7437493B2 (en) 2001-09-28 2008-10-14 Dot Hill Systems Corp. Modular architecture for a network storage controller
US7707304B1 (en) 2001-09-28 2010-04-27 Emc Corporation Storage switch for storage area network
US20030069995A1 (en) * 2001-10-05 2003-04-10 Fayette Brad K. Method and system for communicating among heterogeneous systems
JP2003141054A (en) 2001-11-07 2003-05-16 Hitachi Ltd Storage management computer
CN100518108C (en) 2001-11-12 2009-07-22 艾利森电话股份有限公司 Method for providing quality of service in ieee 802.11 systems
US7366784B2 (en) 2001-11-27 2008-04-29 Hitachi, Ltd. System and method for providing and using a VLAN-aware storage device
US6775672B2 (en) 2001-12-19 2004-08-10 Hewlett-Packard Development Company, L.P. Updating references to a migrated object in a partition-based distributed file system
US6772161B2 (en) 2001-12-19 2004-08-03 Hewlett-Packard Development Company, L.P. Object-level migration in a partition-based distributed file system
US6775673B2 (en) 2001-12-19 2004-08-10 Hewlett-Packard Development Company, L.P. Logical volume-level migration in a partition-based distributed file system
US7599360B2 (en) 2001-12-26 2009-10-06 Cisco Technology, Inc. Methods and apparatus for encapsulating a frame for transmission in a storage area network
US7433948B2 (en) 2002-01-23 2008-10-07 Cisco Technology, Inc. Methods and apparatus for implementing virtualization of storage within a storage area network
US7483403B2 (en) 2002-01-10 2009-01-27 Robert Bosch Gmbh Protocol for reliable, self-organizing, low-power wireless network for security and building automation systems
US6934799B2 (en) 2002-01-18 2005-08-23 International Business Machines Corporation Virtualization of iSCSI storage
EP1329809B1 (en) 2002-01-18 2008-08-06 Hewlett-Packard Company, A Delaware Corporation Distributed computing system and method
US7912059B1 (en) 2002-02-08 2011-03-22 Hatteras Networks Methods, aggregation devices, and computer program products for distinguishing between sub-networks coupled to aggregation device ports by using an independent sub-network identifier domain space for each port
US7283527B2 (en) 2002-02-27 2007-10-16 International Business Machines Corporation Apparatus and method of maintaining two-byte IP identification fields in IP headers
US7535913B2 (en) 2002-03-06 2009-05-19 Nvidia Corporation Gigabit ethernet adapter supporting the iSCSI and IPSEC protocols
US7539991B2 (en) 2002-03-21 2009-05-26 Netapp, Inc. Method and apparatus for decomposing I/O tasks in a raid system
US7149769B2 (en) 2002-03-26 2006-12-12 Hewlett-Packard Development Company, L.P. System and method for multi-destination merge in a storage area network
US6683883B1 (en) 2002-04-09 2004-01-27 Sancastle Technologies Ltd. ISCSI-FCP gateway
US6912622B2 (en) 2002-04-15 2005-06-28 Microsoft Corporation Multi-level cache architecture and cache management method for peer-to-peer name resolution protocol
US7188194B1 (en) 2002-04-22 2007-03-06 Cisco Technology, Inc. Session-based target/LUN mapping for a storage area network and associated method
US6895461B1 (en) * 2002-04-22 2005-05-17 Cisco Technology, Inc. Method and apparatus for accessing remote storage using SCSI and an IP network
US7146427B2 (en) 2002-04-23 2006-12-05 Lsi Logic Corporation Polling-based mechanism for improved RPC timeout handling
US20030202510A1 (en) * 2002-04-26 2003-10-30 Maxxan Systems, Inc. System and method for scalable switch fabric for computer network
US20030204611A1 (en) 2002-04-29 2003-10-30 Mccosh John C. Communications tester and method of using same
US6732171B2 (en) 2002-05-31 2004-05-04 Lefthand Networks, Inc. Distributed network storage system with virtualization
JP2004013215A (en) 2002-06-03 2004-01-15 Hitachi Ltd Storage system, storage sub-system, and information processing system including them
US7421736B2 (en) 2002-07-02 2008-09-02 Lucent Technologies Inc. Method and apparatus for enabling peer-to-peer virtual private network (P2P-VPN) services in VPN-enabled network
US7111303B2 (en) 2002-07-16 2006-09-19 International Business Machines Corporation Virtual machine operating system LAN
JP4151330B2 (en) 2002-07-19 2008-09-17 日本電気株式会社 I / O control method for network file system
JP3869769B2 (en) 2002-07-24 2007-01-17 株式会社日立製作所 Switching node device for storage network and access method of remote storage device
US7263108B2 (en) 2002-08-06 2007-08-28 Netxen, Inc. Dual-mode network storage systems and methods
US6741554B2 (en) 2002-08-16 2004-05-25 Motorola Inc. Method and apparatus for reliably communicating information packets in a wireless communication network
US20040047367A1 (en) * 2002-09-05 2004-03-11 Litchfield Communications, Inc. Method and system for optimizing the size of a variable buffer
GB0221464D0 (en) 2002-09-16 2002-10-23 Cambridge Internetworking Ltd Network interface and protocol
US7475124B2 (en) 2002-09-25 2009-01-06 Emc Corporation Network block services for client access of network-attached data storage in an IP network
US7243144B2 (en) 2002-09-26 2007-07-10 Hitachi, Ltd. Integrated topology management method for storage and IP networks
JP4202709B2 (en) 2002-10-07 2008-12-24 株式会社日立製作所 Volume and failure management method in a network having a storage device
US7152069B1 (en) * 2002-10-15 2006-12-19 Network Appliance, Inc. Zero copy writes through use of mbufs
US7509645B2 (en) 2002-10-17 2009-03-24 Intel Corporation Methods and apparatus for load balancing storage nodes in a distributed network attached storage system
US7752294B2 (en) 2002-10-28 2010-07-06 Netapp, Inc. Method and system for dynamic expansion and contraction of nodes in a storage area network
US7616638B2 (en) 2003-07-29 2009-11-10 Orbital Data Corporation Wavefront detection and disambiguation of acknowledgments
US7120666B2 (en) 2002-10-30 2006-10-10 Riverbed Technology, Inc. Transaction accelerator for client-server communication systems
US7047254B2 (en) 2002-10-31 2006-05-16 Hewlett-Packard Development Company, L.P. Method and apparatus for providing aggregate object identifiers
US8005918B2 (en) 2002-11-12 2011-08-23 Rateze Remote Mgmt. L.L.C. Data storage devices having IP capable partitions
US7170890B2 (en) 2002-12-16 2007-01-30 Zetera Corporation Electrical devices with improved communication
US7742473B2 (en) 2002-11-12 2010-06-22 Mark Adams Accelerator module
CN100380878C (en) 2002-11-12 2008-04-09 泽特拉公司 Communication protocols, systems and methods
US7649880B2 (en) 2002-11-12 2010-01-19 Mark Adams Systems and methods for deriving storage area commands
US7327735B2 (en) 2002-11-27 2008-02-05 Alcatel Canada Inc. System and method for detecting lost messages transmitted between modules in a communication device
US20040160975A1 (en) 2003-01-21 2004-08-19 Charles Frank Multicast communication protocols, systems and methods
US7890633B2 (en) 2003-02-13 2011-02-15 Oracle America, Inc. System and method of extending virtual address resolution for mapping networks
US20040181476A1 (en) * 2003-03-13 2004-09-16 Smith William R. Dynamic network resource brokering
US20040184455A1 (en) 2003-03-19 2004-09-23 Institute For Information Industry System and method used by a gateway for processing fragmented IP packets from a private network
US7181521B2 (en) 2003-03-21 2007-02-20 Intel Corporation Method and system for selecting a local registry master from among networked mobile devices based at least in part on abilities of the mobile devices
US6904470B1 (en) 2003-03-26 2005-06-07 Emc Corporation Device selection by a disk adapter scheduler
JP4123088B2 (en) 2003-08-06 2008-07-23 株式会社日立製作所 Storage network management apparatus and method
JPWO2005017738A1 (en) 2003-08-13 2006-10-12 富士通株式会社 Print control method, print control apparatus, and print control program
US7526577B2 (en) 2003-09-19 2009-04-28 Microsoft Corporation Multiple offload of network state objects with support for failover events
US7415018B2 (en) 2003-09-17 2008-08-19 Alcatel Lucent IP Time to Live (TTL) field used as a covert channel
GB2406742B (en) 2003-10-03 2006-03-22 3Com Corp Switching fabrics and control protocols for them
US7436789B2 (en) 2003-10-09 2008-10-14 Sarnoff Corporation Ad Hoc wireless node and network
JP4397675B2 (en) 2003-11-12 2010-01-13 株式会社日立製作所 Computer system
US7333994B2 (en) 2003-12-18 2008-02-19 Microsoft Corporation System and method for database having relational node structure
US20050166022A1 (en) * 2004-01-28 2005-07-28 Hitachi, Ltd. Method and apparatus for copying and backup in storage systems
KR100600862B1 (en) * 2004-01-30 2006-07-14 김선권 Method of collecting and searching for access route of infomation resource on internet and Computer readable medium stored thereon program for implementing the same
US8990430B2 (en) 2004-02-19 2015-03-24 Cisco Technology, Inc. Interface bundles in virtual network devices
US7447209B2 (en) 2004-03-09 2008-11-04 The University Of North Carolina Methods, systems, and computer program products for modeling and simulating application-level traffic characteristics in a network based on transport and network layer header information
JP2005265914A (en) 2004-03-16 2005-09-29 Ricoh Co Ltd Zoom lens, camera and personal digital assistance
US7409494B2 (en) 2004-04-30 2008-08-05 Network Appliance, Inc. Extension of write anywhere file system layout
US7706364B2 (en) 2004-05-19 2010-04-27 Cisco Technology, Inc. Virtual network device clusters
US7886299B2 (en) * 2004-06-01 2011-02-08 Hitachi, Ltd. Method of dynamically balancing workload of a storage system
US20050270856A1 (en) 2004-06-03 2005-12-08 Inphase Technologies, Inc. Multi-level format for information storage
US8155117B2 (en) 2004-06-29 2012-04-10 Qualcomm Incorporated Filtering and routing of fragmented datagrams in a data network
US20060036602A1 (en) 2004-08-13 2006-02-16 Unangst Marc J Distributed object-based storage system that stores virtualization maps in object attributes
US7515589B2 (en) 2004-08-27 2009-04-07 International Business Machines Corporation Method and apparatus for providing network virtualization
US20060077902A1 (en) 2004-10-08 2006-04-13 Kannan Naresh K Methods and apparatus for non-intrusive measurement of delay variation of data traffic on communication networks
US7536525B2 (en) 2004-11-09 2009-05-19 Dell Products L.P. Virtual machine hot cloning including freezing and unfreezing memory in a distributed network
US7742454B2 (en) 2004-12-16 2010-06-22 International Business Machines Corporation Network performance by dynamically setting a reassembly timer based on network interface
US7564843B2 (en) 2004-12-16 2009-07-21 International Business Machines Corporation Method, system and article for improved network performance by avoiding IP-ID wrap-arounds causing data corruption on fast networks
US7467189B2 (en) * 2005-01-21 2008-12-16 Microsoft Corporation Resource identifier zone translation
ATE536685T1 (en) 2005-02-04 2011-12-15 Huawei Tech Co Ltd INFORMATION SENDING PROCEDURE
US7702850B2 (en) 2005-03-14 2010-04-20 Thomas Earl Ludwig Topology independent storage arrays and methods
EP1861949A2 (en) 2005-03-24 2007-12-05 Nokia Corporation Notification of a receiving device about a forthcoming transmission session
WO2006127013A1 (en) 2005-05-26 2006-11-30 Zetera Corporation Virtual devices and virtual bus tunnels, modules and methods
US7620981B2 (en) 2005-05-26 2009-11-17 Charles William Frank Virtual devices and virtual bus tunnels, modules and methods
US8819092B2 (en) 2005-08-16 2014-08-26 Rateze Remote Mgmt. L.L.C. Disaggregated resources and access methods
US7743214B2 (en) 2005-08-16 2010-06-22 Mark Adams Generating storage system commands
CN101371229B (en) 2005-10-06 2012-04-25 雷特泽远程管理有限公司 Resource command message and method thereof
US9270532B2 (en) 2005-10-06 2016-02-23 Rateze Remote Mgmt. L.L.C. Resource command messages and methods
US20070101023A1 (en) 2005-10-28 2007-05-03 Microsoft Corporation Multiple task offload to a peripheral device
US9049205B2 (en) 2005-12-22 2015-06-02 Genesys Telecommunications Laboratories, Inc. System and methods for locating and acquisitioning a service connection via request broadcasting over a data packet network
US7924881B2 (en) 2006-04-10 2011-04-12 Rateze Remote Mgmt. L.L.C. Datagram identifier management

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US6081879A (en) * 1997-11-04 2000-06-27 Adaptec, Inc. Data processing system and virtual partitioning method for creating logical multi-level units of online storage

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
LEE E K ET AL: "A Comparison of Two Distributed Disk Systems", DIGITAL SYSTEMS RESEARCH CENTER - RESEARCH REPORT SRC-155, 30 April 1998 (1998-04-30), XP002368118 *
LEE E K ET AL: "PETAL: DISTRIBUTED VIRTUAL DISKS", 7TH. INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATION SYSTEMS. CAMBRIDGE, MA., OCT. 1 - 5, 1996, INTERNATIONAL CONFERENCE ON ARCHITECTURAL SUPPORT FOR PROGRAMMING LANGUAGES AND OPERATION SYSTEMS (ASPLOS), NEW, vol. CONF. 7, 1 October 1996 (1996-10-01), pages 84 - 92, XP000681711, ISBN: 0-89791-767-7 *
SATRAN J ET AL: "Internet Small Computer Systems Interface (iSCSI)", IETF STANDARD, INTERNET ENGINEERING TASK FORCE, IETF, CH, April 2004 (2004-04-01), XP015009500, ISSN: 0000-0003 *

Cited By (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
USRE47411E1 (en) 2005-08-16 2019-05-28 Rateze Remote Mgmt. L.L.C. Disaggregated resources and access methods
USRE48894E1 (en) 2005-08-16 2022-01-11 Rateze Remote Mgmt. L.L.C. Disaggregated resources and access methods
US7636758B1 (en) 2008-07-10 2009-12-22 Gene Fein Advertisement forwarding storage and retrieval network

Also Published As

Publication number Publication date
US20070043771A1 (en) 2007-02-22
USRE48894E1 (en) 2022-01-11
US8819092B2 (en) 2014-08-26
USRE47411E1 (en) 2019-05-28

Similar Documents

Publication Publication Date Title
USRE48894E1 (en) Disaggregated resources and access methods
US7620981B2 (en) Virtual devices and virtual bus tunnels, modules and methods
US7827317B2 (en) Apparatus for management of mixed protocol storage networks
US7240098B1 (en) System, method, and software for a virtual host bus adapter in a storage-area network
US8838850B2 (en) Cluster control protocol
US8972547B2 (en) Method and apparatus for dynamically configuring virtual internet protocol addresses
US7519769B1 (en) Scalable storage network virtualization
US20030097607A1 (en) System and method for ensuring the availability of a storage system
US20040098458A1 (en) Distributed computing infrastructure including multiple collaborative sessions
US20070112931A1 (en) Scsi-based storage area network having a scsi router that routes traffic between scsi and ip networks
US20100080237A1 (en) Fibre channel proxy
EP1382176A2 (en) System and method for accessing a storage area network as network attached storage
US20140229695A1 (en) Systems and methods for backup in scale-out storage clusters
US8924513B2 (en) Storage system
US20170126479A1 (en) Implementing switchover operations between computing nodes
US20040139194A1 (en) System and method of measuring and monitoring network services availablility
US20100017456A1 (en) System and Method for an On-Demand Peer-to-Peer Storage Virtualization Infrastructure
US8819200B2 (en) Automated cluster node configuration
US9674312B2 (en) Dynamic protocol selection
US7231503B2 (en) Reconfiguring logical settings in a storage system
CN1592231A (en) Maintaining unit structure of high extending internet superserver and its method
JP4640335B2 (en) Data storage system
US10768834B2 (en) Methods for managing group objects with different service level objectives for an application and devices thereof
US7539711B1 (en) Streaming video data with fast-forward and no-fast-forward portions
Chavis et al. A Guide to the IBM Clustered Network File System

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application
NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 05807674

Country of ref document: EP

Kind code of ref document: A1