Back to Portfolio
AI Product

Badan AI & Yield Prediction

Winner @ Mitr Phol GenAI Hackathon 2025

Badan AI & Yield Prediction

Impact

Spatial probability prediction & Agentic AI integration.

Overview

The Challenge: Blind Exploration

Water is the lifeblood of agriculture, yet finding sustainable groundwater sources (Badan water) remains a game of chance for many farmers. Traditional methods rely on costly geological surveys or pure intuition, leading to low success rates and wasted resources. For Mitr Phol, optimizing water yield is crucial for sugarcane productivity. We needed a way to peer beneath the surface without drilling a single hole.
Presentation Vibe

Figure: Presentation Vibe

Our Approach: Geospatial Intelligence

We built a predictive engine that treats the earth's surface as a dataset. By combining satellite imagery, topographical data, and historical yield records, we engineered a model capable of "seeing" water potential.
  • Satellite Data: Sentinel-2 imagery for vegetation indices (NDVI) and moisture levels.
  • Topography: DEM (Digital Elevation Model) data to analyze slope, aspect, and catchment areas.
  • Soil Composition: Geological maps digitized to understand permeability/porosity. Merging these layers allowed us to create a "Probability Heatmap" of the target areas, highlighting zones with the highest likelihood of accessible groundwater.

The Architecture

The system pipeline consists of three main stages: Data Ingestion, Processing, and Insight Delivery.
  1. Ingestion: Automated scripts pull daily satellite updates and weather station data.
  2. Probabilistic Modeling:
    • Badan Water Probability: We utilize a Gaussian Process (GP) classifier to infer the probability of groundwater presence. GPs allow us to model uncertainty explicitly—crucial when deciding where to drill in unexplored terrain.
    • Water Demand: A separate Regression Model forecasts the water demand for specific crop cycles, analyzing evapotranspiration rates and soil moisture deficits.
  3. Delivery: The results are stored in a PostGIS database, ready for querying.
    System Architecture Diagram

    Figure: System Architecture Diagram

    This modular architecture ensures that as new ground-truth data comes in from drilling logs, the model automatically retrains and improves its precision over time.

Technology Stack

We harnessed a modern stack to handle the complexity of geospatial ML and Agentic workflows:
  • Data Sources: Sentinel satellite data via Google Earth Engine, public government datasets, and internal Mitr Phol agricultural records.
  • Processing: GeoPandas for complex spatial manipulation and Pandas for high-volume tabular data.
  • Modeling: PyTorch and Scikit-learn for training our custom regression and Gaussian Process models.
  • Agentic Framework: OpenAI models orchestrated by LangChain, utilizing Chroma vector database for high-speed semantic retrieval.

Agentic AI: Democratizing Data

The most powerful model is useless if decision-makers can't use it. That's where our Agentic AI comes in. Instead of forcing field managers to interpret complex heatmaps or SQL queries, we built a natural language interface.
Overview of Badan AI

Figure: Overview of Badan AI

  • Conversational Querying: "Where should we drill in Plot A?" -> returns specific coordinates and confidence scores.
  • Scenario Planning: "If rainfall decreases by 20%, which wells are at risk?" -> simulates yield changes. This "Chat-to-Insight" layer bridges the gap between complex data science and actionable agricultural strategy, empowering every employee to make data-driven decisions instantly.
Machine LearningGeospatial DataAgentic AIAnalytics

Gallery Overview

Badan AI & Yield Prediction gallery image
Badan AI & Yield Prediction gallery image
Badan AI & Yield Prediction gallery image

Siwarat Laoprom © 2026