Aerial computer vision, at scale

GPU inference,
built for aerial CV teams.

Managed multi-tenant inference for teams running CV on gigapixel aerial imagery. NVIDIA Triton inference, TensorRT acceleration. Start with our MIT-licensed drone-CV presets (trees, birds, livestock) or bring your own ONNX. Per-tenant cost attribution — GPU-seconds, tiles, latency — baked into the API, not bolted on in Grafana.

Get a demo See the API

Presets + BYOM

MIT-licensed, no AGPL

Gigapixel

Native imagery support

Per-tenant

Real cost attribution

The platform

Watch a model run on a live corridor.

Tile-aware scheduling, georeferenced outputs, GPU sharing — running on Heronflux right now.

Live app.heronflux.com / jobs / utility-corridor-NE-04

insulator_damaged 0.94

vegetation 0.87

corrosion 0.71

insulator_damaged 0.89

utility-corridor-NE-04

modelbyom/utility-inspection:v3.2

tiles144 / 196 · 73%

detections1,847

gpuA10 · healthy

cost$0.18 / 1k tiles

41.8024°N, 93.2104°W

EPSG:4326 · GSD 2.4 cm/px

API

From import to first inference.

Python SDK, HTTP, or async webhook. Tag every job with a tenant — Heronflux meters per-tile and per-GPU-second automatically.

Drone-CV presets ready, or upload your own ONNX
S3 / GCS / signed-URL inputs · GeoJSON + COG outputs
Per-tenant cost record on every job

# pip install heronflux
from heronflux import Client

client = Client(api_key=...)

job = client.jobs.create(
    model="heronflux/trees",  # preset, or "byom/"
    inputs=["s3://acme/flights/2026-05/*.tif"],
    tenant="midwest-power-and-light",
    geo=True,
)

result = client.jobs.wait(job.id)
# result.detections — GeoJSON
# result.usage     — per-tenant cost & tiles

Why Heronflux

One platform. One workload. Tuned end-to-end.

Horizontal inference (Modal, Replicate, SageMaker) doesn't know what a tile is. Vertical drone-data SaaS doesn't expose the inference layer. Heronflux is the missing layer between them — built for aerial CV, and only aerial CV.

Engineered for inference economics

Tile-aware batching, weight caching across tenants, and GPU sharing with no quality loss. The substrate that makes per-customer inference viable instead of margin-eating.

Geospatial-native, not bolted on

Tiling, projection, and georeferencing are first-class. Inputs can be 50,000×50,000 pixels without you writing a single line of tiling code.

Multi-tenant isolation, real attribution

Per-tenant GPU quotas, weight isolation, and line-item cost attribution. Bill your end-customers what they actually consumed — to the cent.

Drone-CV presets + bring your own

Start in minutes with our MIT-licensed DeepForest presets — trees, birds, livestock — served via NVIDIA Triton. Or upload your own ONNX export (RetinaNet, YOLO, RT-DETR, anything Triton can serve) and deploy it on a dedicated GPU endpoint of your chosen tier — small, standard, or fast. Cold-start, TensorRT engine caching, and per-deployment scale-to-zero handled.

How it works

Three steps from model to georeferenced output.

Drop into an existing pipeline, not replace it.

01 / UPLOAD

Drone-CV preset or your own

Use a DeepForest preset (trees / birds / livestock) out of the box, or upload your own ONNX export. Pick a GPU tier per deployment — your model gets its own RunPod-backed Triton endpoint with TensorRT acceleration and scale-to-zero idle.

02 / RUN

Point at imagery

COG, GeoTIFF, S3 URI, or a flight folder. Heronflux tiles with the right overlap, schedules across the fleet, and respects per-tenant quotas automatically.

03 / DELIVER

Real-world coordinates

Bounding boxes and classifications reprojected to WGS84. Export as GeoJSON, CSV, or KML — or receive a signed JSON payload at your webhook URL the moment a job finishes.

Use cases

What teams build on Heronflux.

From solar O&M scans to autonomous perimeter security — anywhere aerial imagery hits a model and a tenant gets billed.

Solar O&M

Defect & string-outage scans

Agriculture

Weed detection & crop scout

Security

Perimeter & site monitoring

Inspection

Towers, transmission & roofs

Cost attribution

Know exactly what each customer cost you.

Heronflux tracks GPU-time, tile-count, and inference latency per tenant — automatically. Bill your end-customers from real consumption data, not estimates.

Stop over-provisioning to cover the long tail. Start running tight margins on a fleet that does only what it needs to.

Deployments

Three GPU tiers. Pick one per model. Idle is free.

Upload your weights. Pick a tier. We provision a dedicated RunPod-backed endpoint for that (model, GPU) pair, scaled-to-zero by default — you only pay when a job is actually running.

small

NVIDIA L4 / A5000

24 GB VRAM · 1 GPU

$0.68/hr

$0.00019 / sec (flex)

DeepForest presets or small ONNX (~5–25 MB) — best for most aerial CV. Sub-second per tile on standard imagery.

standard

NVIDIA L40S

48 GB VRAM · 1 GPU

$1.91/hr

$0.00053 / sec (flex)

Mid-size ONNX (~50–150 MB) — larger tiles, denser scenes, lower latency under load.

fast

NVIDIA H100 PRO

80 GB VRAM · 1 GPU

$4.18/hr

$0.00116 / sec (flex)

Large ONNX (DETR-x, RT-DETR-x) and batch workloads. When throughput matters more than cost.

$0 when idle. Every deployment is created with workersMin=0. You can have multiple parked at no charge and only pay per-second of actual GPU time. Transparent pass-through pricing + a small Heronflux margin — itemized per job.

Built for engineering teams

Webhooks, change detection, asset hierarchy.

The integration plumbing engineering buyers expect from an inference platform — not an afterthought, not a vertical workflow trap.

Signed customer webhooks

HMAC-SHA256-signed POST to your URL the moment a job hits a terminal state. Header: X-Heronflux-Signature: sha256=…. Verify with a four-line snippet.

Multi-flight change detection

Compare two completed jobs on the same project. We compute IoU-matched detections and split into matched / added / removed — perfect for "did this defect grow?"

KML asset hierarchy

Upload a KML of your towers, panels, or pads. Polygons / points / lines render alongside detections on the map. Tagging detections to assets is on the v1.1 roadmap.

Exports that fit your stack

GeoJSON for QGIS / ArcGIS. CSV for spreadsheets. KML for Google Earth. Print → PDF for client deliverables.

Per-tenant cost attribution

Every job carries a tenant tag. Per-tile and per-GPU-second metering rolls up to a real cost line — bill your end-customer from consumption, not estimates.

Human-in-the-loop dismiss

Mark false positives with one click. Exports and downstream counts respect dismissed rows. Restore anytime.

GPU inference, built for aerial CV teams.