recentpopularlog in

tsuomela : research-data   39

[1710.00027v1] Toward a System Building Agenda for Data Integration
"In this paper we argue that the data management community should devote far more effort to building data integration (DI) systems, in order to truly advance the field. Toward this goal, we make three contributions. First, we draw on our recent industrial experience to discuss the limitations of current DI systems. Second, we propose an agenda to build a new kind of DI systems to address these limitations. These systems guide users through the DI workflow, step by step. They provide tools to address the "pain points" of the steps, and tools are built on top of the Python data science and Big Data ecosystem (PyData). We discuss how to foster an ecosystem of such tools within PyData, then use it to build DI systems for collaborative/cloud/crowd/lay user settings. Finally, we discuss ongoing work at Wisconsin, which suggests that these DI systems are highly promising and building them raises many interesting research challenges. "
research-data  management  integration  sharing  data-science 
november 2018 by tsuomela
Frictionless Data: Making Research Data Quality Visible| International Journal of Digital Curation
"There is significant friction in the acquisition, sharing, and reuse of research data. It is estimated that eighty percent of data analysis is invested in the cleaning and mapping of data (Dasu and Johnson,2003). This friction hampers researchers not well versed in data preparation techniques from reusing an ever-increasing amount of data available within research data repositories. Frictionless Data is an ongoing project at Open Knowledge International focused on removing this friction. We are doing this by developing a set of tools, specifications, and best practices for describing, publishing, and validating data. The heart of this project is the “Data Package”, a containerization format for data based on existing practices for publishing open source software. This paper will report on current progress toward that goal."
research-data  data-curation  analysis  methods 
may 2018 by tsuomela
RDA Research Data Collections WG Recommendations | RDA
"Recent efforts of the Research Data Alliance have established a conceptual model for the management of research data that promotes the use of digital objects, transcending the traditional notion of files and decoupling questions of access and use from location and storage. In this context, the need for building aggregations or collections of such objects has become an essential element. However, contemporary work on object collections focuses on primarily describing such collections through metadata, whereas research data management practice requires not only to describe collections, but to make them actionable by automated processes to be able to cope with ever increasing amounts and volumes of data. To this effect, this recommendation provides a comprehensive model for actionable collections and a technical interface specification to enable client-server interaction. It also reports on first adoption and implementation efforts across communities and institutions and provides perspectives on the use of data types in connection with collection structures, highlighting pathways for possible future work."
research-data  collection  automation 
november 2017 by tsuomela
The State of Open Data Report 2017
"Figshare's annual report, The State of Open Data 2017, looks at global attitudes towards open data. It includes survey results of 2,300 respondents and a collection of articles from industry experts, as well as a foreword from Jean-Claude Burgelman, Head of Unit Open Data Policies and Science Cloud at the European Commission. Its key finding is that open data has become more embedded in the research community - 82% of survey respondents are aware of open data sets and more researchers are curating their data for sharing."
open-data  research-data  scholarly-communication  publishing  report 
november 2017 by tsuomela
Research Information Management: Defining RIM and the Library's Role
"Research information management (RIM) is the aggregation, curation, and utilization of information about research and is emerging as an area of increasing interest and relevance in many university libraries. RIM intersects with many aspects of traditional library services in discovery, acquisition, dissemination, and analysis of scholarly activities, and does so through the nexus with institutional data systems, faculty workflows, and institutional partners. RIM adoption offers libraries new opportunities to support institutional and researcher goals. In this paper prepared by Rebecca Bryant, OCLC Research Senior Program Officer, and a working group of librarians representing OCLC Research Library Partnership institutions, learn more about what RIM is, what is driving RIM adoption, and the library’s role in RIM."
research  research-data  libraries  metrics  scholarly-communication 
october 2017 by tsuomela
FORCE11 | The future of research communications and e-scholarship
"Printer-friendly version FORCE11 is a community of scholars, librarians, archivists, publishers and research funders that has arisen organically to help facilitate the change toward improved knowledge creation and sharing. Individually and collectively, we aim to bring about a change in modern scholarly communications through the effective use of information technology. "
scholarly-communication  publishing  communication  research-data  professional-association 
august 2017 by tsuomela
Data, a first-class research output
" The Make Data Count (MDC) project is funded by the Alfred P. Sloan Foundation to develop and deploy the social and technical infrastructure necessary to elevate data to a first-class research output alongside more traditional products, such as publications. It will run between May 2017 and April 2019. The project will address the significant social as well as technical barriers to widespread incorporation of data-level metrics in the research data management ecosystem through consultation, recommendation, new technical capability, and community outreach. Project work will build upon long-standing partner initiatives supporting research data management and DLM, leverage prior Sloan investments in key technologies such as Lagotto, and enlist the cooperation of the research, library, funder, and publishing stakeholder communities."
research-data  management  metrics  altmetrics  data-science  data  publishing  scholarly-communication 
may 2017 by tsuomela
The Realities of Research Data Management
"The Realities of Research Data Management is a four-part series that explores how research universities are addressing the challenge of managing research data throughout the research lifecycle. Research data management (RDM) has emerged as an area of keen interest in higher education, leading to considerable investment in services, resources and infrastructure to support researchers' data management needs. In this series, we examine the context, influences and choices higher education institutions face in building or acquiring RDM capacity—in other words, the infrastructure, services and other resources needed to support emerging data management practices. Our findings are based on case studies of four institutions: University of Edinburgh (UK), the University of Illinois at Urbana-Champaign (US), Monash University (Australia) and Wageningen University & Research (the Netherlands), in four very different national contexts. "
research  research-data  management  academic  libraries  report 
april 2017 by tsuomela
ckan - The open source data portal software
"CKAN is a powerful data management system that makes data accessible – by providing tools to streamline publishing, sharing, finding and using data. CKAN is aimed at data publishers (national and regional governments, companies and organizations) wanting to make their data open and available."
data-curation  data-publication  data  management  research-data  open-source  server  catalog  metadata  cataloging 
february 2017 by tsuomela
Science.gc.ca - Data Management
"The three federal research funding agencies – the Canadian Institutes of Health Research (CIHR), the Natural Sciences and Engineering Research Council of Canada (NSERC), and the Social Sciences and Humanities Research Council of Canada (SSHRC) – are committed to fostering a robust environment for data stewardship in Canada and internationally, and to helping ensure that Canada is well positioned to contribute to and capitalize on data-intensive science and scholarship. To achieve this, and while recognizing the diversity of data practices and needs within the Canadian research community, the agencies are in the process of reviewing and enhancing their data management requirements for agency-supported researchers. As a step in this process, the agencies have developed the Tri-Agency Statement of Principles on Digital Data Management (the Statement). This builds on the 2013 consultation document, Toward a Policy Framework for Advancing Digital Scholarship in Canada, information provided by a Comprehensive Brief on Research Data Management Policies, and advice received through stakeholder engagement. The agencies sought feedback from the research community in summer 2015 on a draft version of the Statement."
research-data  management  country(Canada)  funding  government  policy 
june 2016 by tsuomela
Home - Portage Network
"THE PORTAGE NETWORK is dedicated to the shared stewardship of research data in Canada through: Developing a national research data culture Fostering a community of practice for research data Building national research data services and infrastructure"
research  research-data  country(Canada)  organization  professional-association  libraries 
may 2016 by tsuomela
OHMS: Oral History Metadata Synchronizer
"The Louie B. Nunn Center for Oral History at the University of Kentucky Libraries has created a web-based, system called OHMS (Oral History Metadata Synchronizer) to inexpensively and efficiently enhance access to oral history online. OHMS provides users word-level search capability and a time-correlated transcript or indexed interview connecting the textual search term to the corresponding moment in the recorded interview online"
history  oral  digital-humanities  tool  transcription  sharing  research-data 
october 2015 by tsuomela
Home | Qualitative Data Repository
"QDR selects, ingests, curates, archives, manages, durably preserves, and provides access to digital data used in qualitative and multi-method social inquiry.  The repository develops and publicizes common standards and methodologically informed practices for these activities, as well as for the reusing and citing of qualitative data.  Four beliefs underpin the repository's mission: data that can be shared and reused should be; evidence-based claims should be made transparently; teaching is enriched by the use of well-documented data; and rigorous social science requires common understandings of its research methods."
data  data-curation  repository  qualitative  methods  social-science  research-data 
september 2015 by tsuomela

Copy this bookmark:





to read