Aerial computer vision, at scale

GPU inference,
built for aerial CV teams.

Managed multi-tenant inference for teams running CV on gigapixel aerial imagery. Bring your own model or start with a tuned YOLO baseline. Per-tenant cost attribution — GPU-seconds, tiles, latency — baked into the API, not bolted on in Grafana.

YOLO + BYOM
Models from day one
Gigapixel
Native imagery support
Per-tenant
Real cost attribution
The platform

Watch a model run on a live corridor.

Tile-aware scheduling, georeferenced outputs, GPU sharing — running on Heronflux right now.

Live app.heronflux.com / jobs / utility-corridor-NE-04
insulator_damaged 0.94
vegetation 0.87
corrosion 0.71
insulator_damaged 0.89
utility-corridor-NE-04
modelyolov8x-utility:v3.2
tiles144 / 196 · 73%
detections1,847
gpuA10 · healthy
cost$0.18 / 1k tiles
41.8024°N, 93.2104°W
EPSG:4326 · GSD 2.4 cm/px
API

From import to first inference.

Python SDK, HTTP, or async webhook. Tag every job with a tenant — Heronflux meters per-tile and per-GPU-second automatically.

  • YOLO baseline ready, or upload your own ONNX / TorchScript
  • S3 / GCS / signed-URL inputs · GeoJSON + COG outputs
  • Per-tenant cost record on every job
# pip install heronflux
from heronflux import Client

client = Client(api_key=...)

job = client.jobs.create(
    model="heronflux/solar-defects-yolov11",
    inputs=["s3://acme/flights/2026-05/*.tif"],
    tenant="midwest-power-and-light",
    geo=True,
)

result = client.jobs.wait(job.id)
# result.detections — GeoJSON
# result.usage     — per-tenant cost & tiles
Why Heronflux

One platform. One workload. Tuned end-to-end.

Horizontal inference (Modal, Replicate, SageMaker) doesn't know what a tile is. Vertical drone-data SaaS doesn't expose the inference layer. Heronflux is the missing layer between them — built for aerial CV, and only aerial CV.

Engineered for inference economics

Tile-aware batching, weight caching across tenants, and GPU sharing with no quality loss. The substrate that makes per-customer inference viable instead of margin-eating.

Geospatial-native, not bolted on

Tiling, projection, and georeferencing are first-class. Inputs can be 50,000×50,000 pixels without you writing a single line of tiling code.

Multi-tenant isolation, real attribution

Per-tenant GPU quotas, weight isolation, and line-item cost attribution. Bill your end-customers what they actually consumed — to the cent.

YOLO baseline + bring your own

Start in minutes with a tuned YOLO baseline, or push your own ONNX, TorchScript, or container. Pin a version per tenant. Cold-start, weight caching, and rollouts handled — you focus on the model.

How it works

Three steps from model to georeferenced output.

Drop into an existing pipeline, not replace it.

01 / UPLOAD

YOLO baseline or your own

Start with the tuned YOLO baseline, or push your own weights as ONNX, TorchScript, or a container. Tag a version per tenant. Heronflux handles cold-start, weight caching, and rollouts.

02 / RUN

Point at imagery

COG, GeoTIFF, S3 URI, or a flight folder. Heronflux tiles with the right overlap, schedules across the fleet, and respects per-tenant quotas automatically.

03 / DELIVER

Real-world coordinates

Bounding boxes, masks, and classifications — all reprojected to the source CRS. GeoJSON, COG, or streamed to your pipeline via webhook.

Use cases

What teams build on Heronflux.

From solar O&M scans to autonomous perimeter security — anywhere aerial imagery hits a model and a tenant gets billed.

Solar O&M
Solar O&M
Defect & string-outage scans
Precision agriculture
Agriculture
Weed detection & crop scout
Perimeter security
Security
Perimeter & site monitoring
Towers & transmission
Inspection
Towers, transmission & roofs
Cost attribution

Know exactly what each customer cost you.

Heronflux tracks GPU-time, tile-count, and inference latency per tenant — automatically. Bill your end-customers from real consumption data, not estimates.

Stop over-provisioning to cover the long tail. Start running tight margins on a fleet that does only what it needs to.

Aerial solar farm

Run your CV models the way they were meant to run.

See how Heronflux handles your imagery, your models, your tenants. We'll walk you through a live workload in 30 minutes.