Please use this identifier to cite or link to this item: http://ridi.ibict.br/handle/123456789/1264
metadata.dc.type: Trabalho apresentado em evento
Title: On building a tool for finding datasets based on a list of researchers or publications
metadata.dc.creator: Carvalho-Segundo, Washington
M. R. Dias, Thiago
metadata.dc.description.resumo: This proposal presents a tool developed in the Python language used to find related datasets of a list of researchers or publications. This tool was applied to a list of articles that a specific group of researchers had declared in their CVs. The target group was chosen based on the highest level that these researchers had obtained in a research productivity grant (1A). As a result, form a list of 1,227 researchers and more than 225 thousand deduplicated publications, it was possible to find 12,030 related datasets, were the most frequent access type is OPEN and the five most frequent related areas of research are Zoology; Chemistry; Genetics; Physics; and Agronomy. The proposed tool will be applied to facilitate populating the research data repository of the national funding agency in Brazil, but it can also be used in other more general contexts, extracting information from open databases, such as ORCID and Wikidata.
Abstract: This proposal presents a tool developed in the Python language used to find related datasets of a list of researchers or publications. This tool was applied to a list of articles that a specific group of researchers had declared in their CVs. The target group was chosen based on the highest level that these researchers had obtained in a research productivity grant (1A). As a result, form a list of 1,227 researchers and more than 225 thousand deduplicated publications, it was possible to find 12,030 related datasets, were the most frequent access type is OPEN and the five most frequent related areas of research are Zoology; Chemistry; Genetics; Physics; and Agronomy. The proposed tool will be applied to facilitate populating the research data repository of the national funding agency in Brazil, but it can also be used in other more general contexts, extracting information from open databases, such as ORCID and Wikidata.
Keywords: Open Science
Scientific Data Repositories
Scientific Publications
Open Data
metadata.dc.subject.cnpq: CNPQ::CIENCIAS EXATAS E DA TERRA::CIENCIA DA COMPUTACAO::METODOLOGIA E TECNICAS DA COMPUTACAO
metadata.dc.language: eng
metadata.dc.publisher.country: Brasil
Publisher: Instituto Brasileiro de Informação em Ciência e Tecnologia
metadata.dc.publisher.initials: IBICT
metadata.dc.rights: Acesso Aberto
URI: http://ridi.ibict.br/handle/123456789/1264
Issue Date: Jun-2021
Appears in Collections:Trabalhos apresentados em Congresso no Exterior

Files in This Item:
File Description SizeFormat 
OR2021_A_tool_for_finding_datasets_based.pdf217,62 kBAdobe PDFView/Open


This item is protected by original copyright