hide
Free keywords:
distributed; data analysis; framework; Typescript; RPC
Abstract:
The success of scientific projects increasingly depends on using data analysis tools and data
in distributed IT infrastructures. Scientists need to use appropriate data analysis tools and
data, extract patterns from data using appropriate computational resources, and interpret the
extracted patterns. Data analysis tools and data reside on different machines because the
volume of the data often demands specific resources for their storage and processing, and
data analysis tools usually require specific computational resources and run-time environments.
The data analytics software framework DASF, which we develop in Digital Earth (Bouwer
et al. (2022)), provides a framework for scientists to conduct data analysis in distributed
environments.