LSID

From HandWiki
Short description: Way to name and locate pieces of information on the World Wide Web

Life Science Identifiers[1][2] are a way to name and locate pieces of information on the web. Essentially, an LSID is a unique identifier for some data, and the LSID protocol specifies a standard way to locate the data (as well as a standard way of describing that data). They are a little like DOIs used by many publishers.

An LSID is represented as a uniform resource name (URN) with the following format:

  • urn:lsid:⟨Authority⟩:⟨Namespace⟩:⟨ObjectID⟩[:⟨Version⟩]

The lsid: namespace, however, is not registered with the Internet Assigned Numbers Authority (IANA), and so these are not strictly URNs or URIs.[3]

LSIDs may be resolved in URLs, e.g. http://zoobank.org/urn:lsid:zoobank.org:pub:CDC8D258-8F57-41DC-B560-247E17D3DC8C

Controversy over the use of LSIDs

There has been a lot of interest in LSIDs in both the bioinformatics and the biodiversity communities, with the latter continuing to use them as a way of identifying species in global catalogues.[4] However, more recently, as understanding has increased of how HTTP URIs can perform a similar naming task,[5][6] the use of LSIDs as identifiers has been criticized[7] as violating the Web Architecture good practice of reusing existing URI schemes.[8] Nevertheless, the explicit separation of data from metadata; specification of a method for discovering multiple locations for data-retrieval; and the ability to discover multiple independent sources of metadata for any identified thing were crucial parts of the LSID and its resolution specification that have not successfully been mimicked by an HTTP-only approach.

The World Wide Web provides a globally distributed communication framework that is essential for almost all scientific collaboration, including bioinformatics. However, several limits and inadequacies were thought to exist, one of which was the inability to programmatically identify locally named objects that may be widely distributed over the network. This perceived shortcoming would have limited our ability to integrate multiple knowledgebases, each of which gives partial information of a shared domain, as is commonly seen in bioinformatics. The Life Science Identifier (LSID) and LSID Resolution System (LSRS) were designed to provide simple and elegant solutions to this problem, consistent with next-generation Semantic Web and semantic grid, based on the extension of existing internet technologies. However, it has more recently been pointed out that some of these perceived shortcomings are not intrinsic to HTTP URIs, and much (though not all) of the functionality that LSIDs provide can be obtained using properly crafted HTTP URIs.[5]

Alternative identifiers for organisms

Alternative identifiers have been proposed for organisms, e.g. the DOI system. NamesforLife (N4L), a private company, set up a system to apply DOIs to organisms. For example, doi:10.1601/nm.3093 is the DOI for Escherichia coli, and doi:10.1601/tx.3093 is the corresponding taxon.[9]

See also

Notes

External links