DataSpace is a web services based infrastructure for exploring, analyzing, and mining remote and distributed data. This site describes DataSpace protocols, DataSpace applications, and open source DataSpace servers and clients.
DataSpace applications employ a protocol for working with remote and distributed data called the DataSpace Transfer Protocol or DSTP. DSTP simplifies working with data by providing direct support for common operations, such as working with attributes, keys and metadata.
The DSTP protocol can be layered over specialized high performance transport protocols such as SABUL. Using protocols such as SABUL, DataSpace applications can effectively work on wide area high performance OC-3, OC-12 and Gbps networks. SABUL currently holds the landspeed record for connecting two distributed clusters, a record set at iGrid 02.
DataSpace is supported by grants from the NSF. DataSpace
is built around standards developed by the
Data Mining Group and
W3C.