SAVIIA: Trusted Data Infrastructures for Field Science

Data Administration Platform
From data capture to decision: SAVIIA links local orchestration with cloud pipelines and governance so that field data can be trusted, reused, and scaled.

What is SAVIIA?

SAVIIA (Sistema de Administración y Visualización de Información para la Investigación y Análisis) is a trusted data infrastructure for UC’s Regional Field Stations Network (RCER). It integrates scientific (sensors, surveys, imagery) and operational data (logistics, maintenance, safety) through a common governance model and hybrid local–cloud architecture, enabling reproducible pipelines, auditable analyses, and secure sharing across teams.

Why it matters

Field research suffers from fragmented sources, intermittent connectivity, and uneven practices. SAVIIA addresses this by:

  • Standardizing capture, metadata, and storage so datasets are FAIR (findable, accessible, interoperable, reusable).
  • Bridging local constraints and cloud-scale analytics to support continuous, trustworthy workflows.
  • Connecting scientific and operational streams to improve data quality, reproducibility, and decision-making in stations and partner projects.

Architecture at a glance

  • Local Orchestrator: edge services for device/sensor ingestion, validation, caching, and scheduled exports; resilient to low/unstable connectivity.
  • Cloud Integration: versioned data lake + workflow engine for ETL/ELT, quality checks, and lineage; analytical sandboxes for teams.
  • Shared Models & Views: curated datasets, dashboards, and APIs for research, station operations, and stakeholders.

Data governance

SAVIIA implements lightweight, enforceable rules centered on:

  • Quality: schema checks, unit validation, completeness, timeliness SLAs.
  • Traceability: dataset identifiers, provenance, code–data–result linkage; reproducible notebooks/pipelines.
  • Access & Sharing: roles, project spaces, embargo/visibility policies; clear licensing for reuse.
  • Safety & Ethics: consent/acceptable-use registers; minimal exposure for sensitive fields.

What we’re building (deliverables)

  • SAVIIA Local: deployable edge stack (ingestion, validation, scheduling, secure sync).
  • SAVIIA Cloud: data lake + workflow templates; metadata registry; dataset catalogue.
  • Operations Kit: runbooks, templates, QA playbooks, and “hello-station” examples.
  • Visualization & APIs: curated views and endpoints for research and station management.
  • Training & Onboarding: short guides for station staff and researchers; governance checklists.

SAVIIA advances the state of practice by operationalizing governance and linking edge capture with cloud analytics, so field science can move from isolated datasets to reliable, decision-ready evidence at scale.

Project Team:

  • Director: Rodrigo A. Carrasco
  • Students: Sol Covacich, Pedro Zavala, Catalina Ortega
Rodrigo A. Carrasco
Rodrigo A. Carrasco
Associate Professor & the UC Data Science Initiative Director
Sol Covacich
Sol Covacich
B.S. in Engineering
Pedro Zavala
Pedro Zavala
B.S. in Engineering
Catalina Ortega
Catalina Ortega
B.S. in Engineering

Related