Available Datasets
GARUDA provides access to three primary data domains: Climate, Carbon, and Finance. All datasets are available via API and as downloadable Parquet files.
Climate Data
Section titled “Climate Data”BMKG Climate Observations
Section titled “BMKG Climate Observations”Coverage: 100+ weather stations across Indonesia
Time Range: 2000–present (historical data available)
Update Frequency: Daily
Size: ~2.3 GB (2020–2024)
Schema:
station_id(String) — BMKG station identifiertimestamp(Timestamp) — Observation time (UTC)temperature_c(Float64) — Temperature in Celsiushumidity_pct(Float64) — Relative humidity percentageprecipitation_mm(Float64) — Precipitation in millimeterswind_speed_kmh(Float64) — Wind speedprovince(String) — Indonesian provinceisland(String) — Island name (e.g., “Jawa”, “Sumatera”, “Kalimantan”)
Example Query:
SELECT province, AVG(temperature_c) as avg_temp, SUM(precipitation_mm) as total_precipFROM climate_observationsWHERE timestamp > '2024-01-01'GROUP BY province;Carbon Data
Section titled “Carbon Data”Carbon Project Metadata
Section titled “Carbon Project Metadata”Coverage: 50+ registered carbon projects in Indonesia
Time Range: 2015–present
Update Frequency: Monthly
Status: Available in v0.3.0
Schema (Preview):
project_id(String) — Unique project identifierproject_name(String) — Project nameproject_type(String) — e.g., “forest conservation”, “renewable energy”province(String) — Project locationvintage_start(Date) — Credit vintage start datevintage_end(Date) — Credit vintage end datecredit_quantity(Int64) — Total credits issuedcredit_price_usd(Float64) — Market price
Finance Data
Section titled “Finance Data”Indonesian Market Data
Section titled “Indonesian Market Data”Coverage: Stock exchange, commodity prices, currency rates
Time Range: 2010–present
Update Frequency: Real-time (1-minute delay)
Status: Available in v0.3.0
Schema (Preview):
timestamp(Timestamp) — Quote timesymbol(String) — Ticker symbolprice_idr(Float64) — Price in IDRvolume(Int64) — Trading volumemarket(String) — Market name (IDX, commodity exchange, etc.)
Saka Calendar Reference
Section titled “Saka Calendar Reference”Coverage: Complete Saka Calendar mapping
Time Range: 2000–2050
Update Frequency: Static (annual updates)
Size: ~5 MB
Schema:
gregorian_date(Date) — Gregorian calendar datesaka_sasih(String) — Saka month namesaka_pawukon(String) — Pawukon cyclesaka_eka(Int32) — Eka (day of cycle)
Example Query:
SELECT c.timestamp, c.temperature_c, s.saka_sasihFROM climate_observations cJOIN saka_calendar s ON DATE(c.timestamp) = s.gregorian_dateWHERE c.province = 'Bali'LIMIT 100;Geographic Coverage
Section titled “Geographic Coverage”All datasets are partitioned by:
- Province — 34 Indonesian provinces
- Island — 17,000+ islands grouped into major islands (Jawa, Sumatera, Kalimantan, Sulawesi, Papua, Nusa Tenggara, Maluku)
Query by geographic hierarchy:
SELECT * FROM climate_observationsWHERE island = 'Jawa' AND province = 'Jawa Barat';Data Quality
Section titled “Data Quality”All datasets undergo quality checks:
- Completeness: >95% non-null values per column
- Consistency: Schema validation on ingestion
- Timeliness: Updates within 24 hours of source publication
Quality metrics are available via the /data-quality endpoint (v0.3.0+).
Access & Licensing
Section titled “Access & Licensing”- Free Tier: Open Parquet downloads (CC BY 4.0)
- Developer+: API access with rate limits
- Enterprise: Custom data ingestion and SLA
See Pricing for details.
Coming Soon (v0.3.0)
Section titled “Coming Soon (v0.3.0)”- Interactive data catalog with coverage maps
- Data quality dashboards
- Custom data ingestion for enterprise customers
- Advanced analytics (reversal risk scoring, climate correlation)