Finding datasets to use in research

May 18, 2020

In empirical research, scientific data is a very important part. Due to modern technology, scientists are creating a much larger amount of scientific data. Many new tools and software developed that allow you to analyze, store, manage, and share this data. Scientific data is any information collected, by observation, through surveys, experiments, created or collected using a variety of data collection methods to validate research findings.

Research data is any information that has been collected, observed, generated or created to validate original research findings. Research data can take many forms. It might be: documents, spreadsheets, laboratory notebooks, field notebooks, diaries, questionnaires, transcripts, codebooks, audiotapes, videotapes, photographs, films, test responses, slides, artefacts, specimens, samples, collections of digital outputs, data files, database contents (video, audio, text, images), contents of an application (input, output, logfiles for analysis software, simulation software, schemas), methodologies and workflows, standard operating procedures and protocols.

Writing scientific papers you can to reuse existing data instead of collecting it yourself, there are good sources for potentially relevant existing data. Researchers, governments, and other institutions are increasingly making their datasets available to other researchers for re-use.There are thousands of data repositories worldwide, housing datasets from a wide range of research areas. Some repositories hold subject specific data, while others contain data from several different disciplines.

Find repositories in your research area.

Search the re3data (Registry of Research Data Repositories) to find relevant repositories.

re3data is a DataCite service. It provides information on over 2,000 research data repositories from every domain and in every country. You can browse by subject, country, or content type, and search by any combination of 41 different attributes.

The results in re3data.org provide a lot of information about repository. In this results you can find a general description of the repository, subject areas that it relates to, whether data in the repository is openly available, and whether terms of use or reuse licenses are specified for datasets in the repository.

re3data will help you find discipline specific repositories

DataCite will search across multiple repositories to find datasets related to your subject.

Other resources for finding data repositories and archives exist, such as the FAIRsharing databases catalogue and the Open Access Directory's list of data repositories

You can also search scientific datasets in multi-disciplinary repositories:
DRYAD – is a repository governed by a nonprofit membership organization. Several publishers partner with DRYAD to coordinate the submission of manuscripts with submission of data to DRYAD.
FigShare – is a repository owned by Digital Science. It is free to use for researchers.
Zenodo – data uploading, storage and access is free. This repository is linked to Horizon 2020 projects, and OpenAIRE. The European Commission funds this repository. Each data set assigned digital persistent identifiers (DOIs).
Mendeley.Data – is an open research data repository, where researchers can upload and share their research data. Datasets can be shared privately amongst individuals, as well as published to share with the world. Sharing research data is important for science as it enables data reuse and supports reproducibility of studies. Sharing data is also a fantastic way for you as a researcher, to gain exposure for your research outputs, as every dataset has a DOI and can be cited.

Others multi-disciplinary repositories:
4TU.ResearchData – archive for technical data.
B2SHARE
PANGAEA

National data repositories:
The National Open Access Research Data Archive (MIDAS) is a digital platform for storing, managing and publishing research data from various scholarly disciplines and for discovering research data created by others.

Search methods are different in each repository. We recommend finding a repository user guide that will allow you to perform the most optimal search and efficiently find data set what you interesting. Also, carefully evaluate the quality of the data source.

Research data is the same as any other type of publication, cite it as you would any other academic source: that is, with appropriate citations within the text and as an entry in your reference list.
A simple data citation format is:  Creator (Publication Year): Title. Version. Publisher. Resource Type. Identifier

Do you have any questions?

All the answers to your questions:
E-mail: infokonsultantai@vgtu.lt
Sent messages: VGTU library Facebook

 

Related news

Interdisciplinarity in practice: how Electronics and Medical Engineering students developed a Human motion analysis system
Interdisciplinarity in practice: how Electronics and Medical Engineering students developed a Human motion analysis system
Modern engineering solutions are increasingly created through collaboration between specialists from different fields. The university environment provides an opportunity to combine diverse competencies and develop solutions that would be difficult to achieve within the boundaries of a single discipline. Such collaboration was also at the heart of a bachelor’s thesis project in which VILNIUS TECH students Laura Venckutė (Faculty of Electronics) and Abderrazak El Aamrani (Faculty of Mechanics) combined expertise in electronics and medical engineering to develop a human motion recognition and evaluation system. From an Idea to Interdisciplinary Collaboration At the beginning of the project, students from the Medical Engineering and Electronics Engineering study programmes sought to address a problem relevant to both sports and rehabilitation: the lack of accessible systems capable of automatically evaluating human movements and providing immediate feedback. As the project authors point out, incorrect movements can reduce training effectiveness and increase the risk of injuries during rehabilitation, sports activities, or everyday tasks. For this reason, they decided to look for a technological solution that could help objectively assess movement quality. The idea emerged from previous projects and experience gained during their studies, while an important catalyst was the opportunity for collaboration proposed by their supervisors. From the outset, it was clear that the project would require expertise from different fields, as motion analysis involves not only developing a technical system but also defining meaningful criteria for evaluating human movement. [caption id="attachment_120707" align="alignnone" width="2048"] Electronics and Medical Engineering students developed a Human motion analysis system[/caption] The students brought different, yet closely interconnected and complementary competencies to the project. The Electronics Engineering student was responsible for computer vision, embedded systems, and system integration, while the Medical Engineering student contributed expertise in biomechanics and human movement assessment. Although responsibilities were divided according to individual areas of expertise, key decisions were made collaboratively. From the Initial Concept to a Functional System In the early stages, the team planned to develop a system capable of analysing a broader range of movement patterns and performing more advanced analytical functions. However, as the project progressed, technical limitations, available hardware resources, and the scope of the bachelor’s thesis had to be taken into account. As a result, some ideas had to be abandoned. According to the team members, no major disagreements arose during the project. Decisions were made by discussing possible alternatives, evaluating how well they aligned with the project objectives, and, whenever possible, testing different approaches in practice. When technical and medical requirements conflicted, the team sought solutions that best balanced project goals and implementation constraints. The final outcome of the project is a human motion recognition and feedback system based on a pose estimation algorithm designed for basketball shooting analysis. The system detects a person in real time, estimates body posture, evaluates shooting technique according to biomechanical criteria, and subsequently provides feedback to the user. The Value of Interdisciplinarity and Future Opportunities During testing, the system performed better than expected. It successfully analysed the movements of users of different heights and maintained reliable performance at distances of up to 12 metres. According to the students, not only did the technical results exceed expectations, but the collaboration process itself also proved highly successful. In their view, such a project could theoretically be completed by a specialist from a single field. In practice, however, this would be difficult and inefficient. The project required both expertise in electronics and an understanding of human movement analysis. Without competencies from both fields, considerably more time would have been needed for learning unfamiliar topics and identifying appropriate solutions. Looking ahead, the students see opportunities to further develop the project by improving system stability, optimising resource usage, expanding the range of supported movement patterns, and increasing motion recognition accuracy. Although they are not yet certain whether they will continue working in this specific area, they hope to further deepen their expertise in related fields of technology and engineering. Supervisors’ Insights: Interdisciplinarity as a Foundation of Future Engineering The thesis supervisors emphasise that the nature of the project itself required collaboration across disciplines. However, the greatest value of the project lies not only in the technical solution that was developed, but also in the students’ ability to work together effectively. Professor Kristina Daunoravičienė, lecturer in the Medical Engineering study programme, notes that developing a human posture recognition and evaluation system requires both an understanding of human movement and biomechanics, as well as the ability to create a technical system capable of collecting, processing, and presenting information to the user. „The need for different knowledge and competencies made this topic an excellent platform for collaboration between Medical Engineering and Electronics Engineering students. Such projects foster not only technical solutions but also the ability to understand the logic, limitations, and priorities of other disciplines,“ says Prof. Dr Kristina Daunoravičienė. Although the students were initially unfamiliar with one another and came from different engineering backgrounds, a shared goal quickly became the foundation of successful cooperation. According to the supervisor, Medical Engineering contributed the perspective of human movement assessment and result interpretation, while Electronics Engineering provided expertise in system architecture, prototyping, and optimisation. According to Prof. Dr K. Daunoravičienė, the most important outcome of the project is not only the developed prototype and its comparison with the Xsens motion analysis system: „Equally important are the competencies of collaboration, communication, trust, initiative, and the ability to learn from one another. These are the qualities that allow good ideas to become real, functioning solutions.“ Associate Professor Dr Vytautas Abromavičius of the Faculty of Electronics also points out that in the era of artificial intelligence, technical expertise alone is no longer sufficient. Clear communication, the ability to understand specialists from different fields, and working together towards a common goal are becoming increasingly important. „This bachelor’s thesis demonstrated that our students communicated exceptionally well and were able to explain specialised professional terminology in a simple and understandable way. This mutual understanding enabled them to effectively combine knowledge from different disciplines and achieve an excellent result,“ says Assoc. Prof. Dr Vytautas Abromavičius. According to him, the need for interdisciplinary projects in modern engineering continues to grow. Every real-world product developed for the market consists of multiple interconnected components; therefore, a broader understanding of the problem leads to better product applicability and a more complete final outcome.
More
VILNIUS TECH Professor A. Čenys Represents Baltic States at Google Leadership Summit
VILNIUS TECH Professor A. Čenys Represents Baltic States at Google Leadership Summit
Prof. Antanas Čenys, a prominent researcher at VILNIUS TECH and the SustAInLivWork project, participated by personal invitation from Google in the exclusive Google for Education Higher Education Leader Series EMEA in London. Prof. Čenys was the only AI and cybersecurity expert from the Baltic countries to be personally invited to this high-level summit. The exclusive event brought together higher education leaders, technology pioneers, and innovation stakeholders from across Europe, the Middle East, and Africa. The summit focused on shaping the future of Artificial Intelligence, digital transformation, cybersecurity, and driving responsible innovation within the global academic and industrial landscapes. The insights and discussions from the London summit strongly resonate with the core mission of the SustAInLivWork project: strengthening Europe’s capacity to develop, deploy, and scale trustworthy AI solutions while simultaneously building the advanced skills, critical infrastructures, and innovation ecosystems required for a sustainable digital future. Through SustAInLivWork, VILNIUS TECH and its partners are actively delivering: An International AI Cluster: bridging the gap between research excellence, industry, and public sector stakeholders; Advanced Innovation Services: driving AI and data-driven solutions for various sectors; AI Skills Development: establishing lifelong learning opportunities and specialized training; Cross-Regional Collaboration: accelerating practical AI adoption and ensuring positive societal impact. The summit also underscored the rapidly growing importance of cybersecurity as a fundamental pillar for secure AI deployment. This focus perfectly aligns with ongoing initiatives led by VILNIUS TECH, including specialized cybersecurity skills development programmes supported by Google.org, aimed at strengthening digital resilience and cyber competence across Europe. As Europe accelerates its comprehensive AI transformation, structured collaboration between universities, industry leaders, policymakers, and major technology providers becomes increasingly paramount. SustAInLivWork and VILNIUS TECH remain committed to contributing to this evolutionary journey by building strong bridges between cutting-edge research, thriving innovation ecosystems, and sustainable societal progress.
More