Real-Time Photogrammetry — TerrainEval, ltd.

Photogrammetry is a computational method for the creation of 3D maps from 2D aerial images. Map coverage area and desired quality are key factors in determining compute processing requirements for this method. Consideration of these factors reveals real-time processing limits beyond small coverage areas. We define "real-time" to be processing completed within the same timeframe as data acquisition - for example, a one-hour flight should produce results within one hour of landing.

Coverage Area Impact on Processing

In photogrammetry processing times vary dramatically based on coverage area. The relationship between ground coverage and processing time is not linear, it is cubic. This fundamental constraint must be faced in real-world deployments.

For areas under 10,000 m² (up to 2.5 acres), with 50-200 images at 1-3 cm ground resolution, processing in 15 minutes to 2 hours is achieveable depending on your hardware. A GPU setup can handle this in about 15-30 minutes, making near-real-time processing possible for very small sites.

As coverage expands to 0.01-0.05 km² (up to 12 acres), with 200-800 images, the runtime changes dramatically. Processing times range from 45 minutes on GPU systems to 8 hours on standard laptops. Real-time processing becomes impossible.

The scaling problem becomes severe for large coverage areas. Sites of 0.25-1 km² with 2,000-5,000 images require 6-18 hours, even on GPU systems. Standard workstations require multi-day processing times and become impractical. Beyond 1 km², distributed high-performance computing becomes a requirement.

Consider the Algorithm

Photogrammetry consists of three distinct computational phases.

Phase 1: Feature Detection and Matching

This phase identifies and matches distinctive features in overlapping images. This operation is well-suited to parallel processing as each image patch can be processed independently. SIFT (Scale-Invariant Feature Transform) and ORB (Oriented FAST and Rotated BRIEF) are computer vision algorithms that identify and describe distinctive keypoints in images for object recognition and feature matching. Modern GPUs can detect thousands of SIFT or ORB features simultaneously across image regions. While this phase scales linearly with image count and resolution, it typically represents only 10-20% of total photogrammetry processing time.

Phase 2: Bundle Adjustment: The Critical Constraint

Bundle adjustment optimizes camera positions, orientations, and 3D point coordinates through iterative non-linear optimization. This is where photogrammetry hits a wall. The algorithm's cubic complexity arises from repeatedly solving large systems of equations, where each iteration requires matrix factorization operations that scale with O(n³), where n is the number of images.

Normal matrix size ∝ (6n + 3m + k)²
where:
n = number of images
m = number of 3D points observed
k = intrinsic camera calibration parameters (typically 5–10)

Memory requirements scale quadratically while processing time scales cubically. No amount of hardware parallelization can overcome this algorithmic constraint because the problem structure itself requires sequential operations with global data dependencies. Modern incremental bundle adjustment techniques can reduce computational burden by fixing previously optimized parameters and only adjusting new observations. This reduces effective problem size from O(n³) to approximately O(w³) where w is a sliding window of recent images. However it also trades global accuracy for speed and still requires periodic global refinement passes.

Phase 3: Dense Reconstruction

Multi-view stereo reconstruction generates the final dense point cloud or mesh. This phase parallelizes well with GPUs through independent stereo matching operations. However, it can only begin after bundle adjustment completes, making it irrelevant to the real-time processing bottleneck.

CPU vs GPU for Photogrammetry

The photogrammetry community often debates CPU versus GPU processing. GPUs excel at parallel operations like feature detection and dense matching. This provides 15-45% overall performance improvement in typical workflows. Bundle adjustment - the dominant bottleneck - is fundamentally CPU-bound due to its sparse matrix operations, irregular memory access patterns, and sequential optimization requirements.

CPUs maintain advantages in:

Sparse matrix operations with irregular memory patterns
Complex branching logic in optimization routines
Support for large memory capacities (256GB+ RAM)

GPUs accelerate:

Parallel feature detection across image regions
Dense stereo matching with independent correlation windows
Image preprocessing and geometric transformations

Memory Requirements and Processing Times

Under 1,000 images: 64 GB RAM handles bundle adjustment matrices

1,000-8,000 images: 256 GB RAM becomes necessary due to quadratic scaling

Over 8,000 images: Exceeds single-machine capacity - distributed computing mandatory

20,000+ images: Requires specialized distributed infrastructure with coordinated memory management

Block-based partitioning with overlap regions, where the problem is split into subsets prior to global alignment, can parallelize bundle adjustment. This method reduces runtime by a factor of 4-8x at a cost of introducing block boundary artifacts.

Even with such optimizations a dataset of 8,000 images generates a bundle adjustment problem that will not fit in the memory of standard workstations. Such methods are most applicable to compute clusters which limit their real-time usability for field operations.

Photogrammetry Map Creation Times (NVIDIA RTX 4090 or similar + high-end CPU):

Image Count	Area	Bundle Adjustment	Total Time	Use Case
< 500	< 0.01 km²	5–30 min	15–90 min	Near-real-time
500–2,000	0.01–0.25 km²	30 min – 4 hrs	1–12 hrs	Same-day delivery
2,000–8,000	0.25–1 km²	4–20 hrs	6–48 hrs	Overnight batch
8,000–20,000	1–10 km²	20–100 hrs	2–7 days	Weekly processing
> 20,000	> 10 km²	100+ hrs	1–2 weeks	Special projects

What Real-Time Actually Means

Real-time photogrammetry with comprehensive reconstruction remains unavailable for coverage areas beyond small sites. Systems reporting real-time operation typically employ one of these methods.

Avoiding Bundle Adjustment: Using SLAM-style incremental pose estimation provides rapid results but sacrifices global accuracy. This works for navigation but not for survey-grade mapping.

Processing Subsets: Analyzing only key frames or image samples speeds processing at a loss of reconstruction quality and completeness.

Pre-computed Camera Poses: Using RTK/PPK GNSS camera position helps optimization but requires positioning hardware in the loop and still does not achieve real-time dense reconstruction.

Sparse Output Only: Generating only sparse point clouds for visualization - defers dense reconstruction for later processing.

Cutting-Edge Approaches

Emerging techniques like Neural Radiance Fields (NeRFs) and 3D Gaussian Splatting aim for faster reconstruction. 3D Gaussian Splatting, a method most prominently advertised by DJI, can train a scene representation in 5-30 minutes for small datasets and render at 100+ FPS once a model is trained. Model training requires initial camera poses, computed through traditional bundle adjustment. NeRFs and variants like Instant-NGP reduce training time to under an hour for small scenes but depend on pre-computed camera poses and struggle with large-scale outdoor environments.

Direct pose estimation using learned approaches eliminates bundle adjustment. Accuracy of these methods is currently worse than traditional methods. This accuracy loss is allowable for simple visuals and unacceptable for surveying and inspection applications. These cutting-edge approaches show promise for specific use cases but do not resolve the real-time reconstruction challenge for large-area mapping,

Operational Expectations

Live Monitoring (Real-Time)

Video streaming and telemetry
Basic visual odometry
2D mosaicking without 3D reconstruction

Rapid Processing (Hours)

Small site documentation (<500 images)
Emergency response mapping
Construction progress monitoring

Professional Analysis (Days)

Large area surveys
High-accuracy mapping projects
Survey grade dense reconstruction

Conclusion

Bundle adjustment constraints create a fundamental barrier to real-time photogrammetry for anything beyond small areas. Operations leveraging photogrammetry workflows must consider batch processing timelines and invest in appropriate computing workflows. Expectations should be set accordingly: real-time visualization is possible, real-time reconstruction is not.

Tell us what you are fielding.

Field testing, data collection, and integration for emerging technology.

Book a 30-minute call