recentpopularlog in

tsuomela : data-management   16

Changing software, hardware a nightmare for tracking scientific data
We've gone into all the problems involved with preserving and sharing scientific data in some detail, but the challenges don't end there. Typically, data doesn't speak for itself; it has to be analyzed and interpreted. And, these days, that analysis generally involves computer tools. Even basic images of cells can end up being processed to look for things like signal intensity and total area of signal. The results of that analysis may end up plugged into a spreadsheet and subjected to a further analysis. This general approach—a pipeline of software tools—makes documenting and reproducing exactly what happened to generate a final result.
science  data  data-management  communication  technology  computer 
november 2010 by tsuomela
Human genome at ten: Science after the sequence : Nature News
Given ten years' of hindsight and the current set of obstacles, it's no surprise that researchers now state somewhat modest expectations for what human genomics can deliver and by when. The rationale for sequencing and exploring the human genome — to revolutionize the finding of new drugs, diagnostics and vaccines, and to tailor treatments to the genetic make-up of individuals — is the same today. But almost half of respondents now say that the benefits of the human genome were oversold in the lead up to 2000. "While I do feel that the gains made by the human genome project are extraordinary and affect my research significantly, I still feel that it was overhyped to the general population," read one typical response.
genetics  biology  technology  history  sts  data-management 
june 2010 by tsuomela
A Metadata Best Practice for a Scientific Data Repository - Journal of Library Metadata
Digital data repositories ought to support immediate operational needs and long-term project goals. This paper presents the Dryad repository's metadata best practice balancing of these two needs. The paper reviews background work exploring the meaning of science, characterizing data, and highlighting data curation metadata challenges. The Dryad repository is introduced, and the initiative's metadata best practice and underlying rationales are described. Dryad's metadata approach includes two prongs: one addressing the long-term goal to align with the Semantic Web via a metadata application profile; and another addressing the immediate need to make content available in DSpace via an extensible markup language (XML) schema. The conclusion summarizes limitations and advantages of the two prongs underlying Dryad's metadata effort.
data-curation  science  repository  data-management  metadata  best-practices 
april 2010 by tsuomela
DigitalCommons@CalPoly - Anna Gold: Data Curation and Libraries: Short-Term Developments, Long-Term Prospects
This paper was prepared as background for a talk given at AGU 2009 on “Data & Libraries.” It summarizes the developments and events from late 2006 through early 2010 that are shaping library roles in scientific data curation while underscoring the range, complexity, and varying granularity of systems, actions, and efforts involved. The main conclusions are: (1) leaders of major research libraries have committed their institutions to support data curation. (2) The library profession has demonstrated significant conceptual progress in characterizing and understanding data curation both in theory and in practice. (3) There has been progress since 2006 in legitimizing library roles in data curation through formal education and certification programs as well as by integrating data curation into established library services and systems.
data-curation  libraries  science  data-management  future  evaluation  2010 
april 2010 by tsuomela
[cs/0208012] Online Scientific Data Curation, Publication, and Archiving
(Submitted on 7 Aug 2002)
Science projects are data publishers. The scale and complexity of current and future science data changes the nature of the publication process. Publication is becoming a major project component. At a minimum, a project must preserve the ephemeral data it gathers.
science  data-management  data-curation  computer  computing  project(Utenn) 
april 2010 by tsuomela
[cs/0502008] Scientific Data Management in the Coming Decade
(Submitted on 2 Feb 2005)
This is a thought piece on data-intensive science requirements for databases and science centers. It argues that peta-scale datasets will be housed by science centers that provide substantial storage and processing for scientists who access the data via smart notebooks.
science  data-management  data-curation  computer  computing  project(Utenn) 
april 2010 by tsuomela
SciDAC Review - Home Page
The SciDAC Review is a quarterly magazine that will share SciDAC projects, news, and achievements.
science  computing  data-management  scientific  management  data  government  project(Utenn)  research  journal  magazine  e-science 
april 2010 by tsuomela
DOE - Science - Office of Advanced Scientific Computing Research (ASCR) - Homepage
The primary mission of the Advanced Scientific Computing Research (ASCR) program is to discover, develop, and deploy the computational and networking tools that enable researchers in the scientific disciplines to analyze, model, simulate, and predict complex phenomena important to the Department of Energy.
science  computing  data-management  scientific  management  data  government  project(Utenn) 
april 2010 by tsuomela
SciDAC Scientific Data Management Center
The U.S. Department of Energy's Scientific Discovery through Advanced Computing (SciDAC) program brings together the nation's top researchers to tackle challenging scientific problems. The Office of Advanced Scientific Computing Research in DOE's Office of Science supports multidisciplinary SciDAC projects aimed at developing future energy sources, studying global climate change, accelerating research in designing new materials,improving environmental cleanup methods, and understanding physics from the tiniest particles to massive supernovae explosions.
science  computing  data-management  scientific  management  data  government  project(Utenn) 
april 2010 by tsuomela
Ensuring the Integrity, Accessibility, and Stewardship of Research Data in the Digital Age
As digital technologies are expanding the power and reach of research, they are also raising complex issues. These include complications in ensuring the validity of research data; standards that do not keep pace with the high rate of innovation; restrictions on data sharing that reduce the ability of researchers to verify results and build on previous research; and huge increases in the amount of data being generated, creating severe challenges in preserving that data for long-term use.

Ensuring the Integrity, Accessibility, and Stewardship of Research Data in the Digital Age examines the consequences of the changes affecting research data with respect to three issues - integrity, accessibility, and stewardship-and finds a need for a new approach to the design and the management of research projects.
data-curation  archive  access  computing  data  research  science  preservation  internet  data-management 
march 2010 by tsuomela

Copy this bookmark:





to read