A relevancy algorithm for curating earth science data around phenomenon

作者:Maskey Manil*; Ramachandran Rahul; Li Xiang; Weigel Amanda; Bugbee Kaylin; Gatlin Patrick; Miller J J
来源:Computers & Geosciences, 2017, 106: 164-170.
DOI:10.1016/j.cageo.2017.06.007

摘要

Earth science data are being collected for various science needs and applications, processed using different algorithms at multiple resolutions and coverages, and then archived at different archiving centers for distribution and stewardship causing difficulty in data discovery. Curation, which typically occurs in museums, art galleries, and libraries, is traditionally defined as the process of collecting and organizing information around a common subject matter or a topic of interest. Curating data sets around topics or areas of interest addresses some of the data discovery needs in the field of Earth science, especially for unanticipated users of data. This paper describes a methodology to automate search and selection of data around specific phenomena. Different components of the methodology including the assumptions, the process, and the relevancy ranking algorithm are described. The paper makes two unique contributions to improving data search and discovery capabilities. First, the paper describes a novel methodology developed for automatically curating data around a topic using Earth science metadata records. Second, the methodology has been implemented as a stand-alone web service that is utilized to augment search and usability of data in a variety of tools.

  • 出版日期2017-9