Key Facts
- ✓ The system ingests telemetry at approximately 100,000 messages per second from satellites, ground stations, weather radar, IoT humidity sensors, and space weather monitors.
- ✓ Machine learning models are trained on billions of data points from actual multi-orbit operations, combining deterministic physics with pattern recognition.
- ✓ Predictions are delivered 3-5 minutes in advance with over 90% accuracy, providing sufficient time for traffic rerouting before data loss occurs.
- ✓ The technology uses federated learning to aggregate patterns across constellations without sharing raw telemetry data, addressing privacy concerns.
- ✓ Current deployment options include on-premise air-gapped environments, government clouds (AWS GovCloud, Azure Government), and standard commercial clouds.
- ✓ The team includes engineers with experience at SpaceX managing Starlink constellation health, Blue Origin working on New Glenn test infrastructure, and NASA handling deep space communications.
Quick Summary
A team of former SpaceX, Blue Origin, and NASA engineers has developed an artificial intelligence system designed to predict satellite communication link failures before they occur. The technology aims to solve a critical problem in satellite operations where data loss often happens before operators can react to degrading signals.
By ingesting massive streams of telemetry data and applying advanced machine learning models, the system provides a proactive approach to satellite mission assurance, offering predictions up to five minutes in advance with high accuracy.
The Problem: Reactive Operations
The core challenge in satellite communications is that link degradation is often noticed only after data has already been lost. Traditional operations rely on reactive monitoring, where operators watch dashboards and manually reroute traffic when signal-to-noise ratios drop below thresholds. This approach becomes increasingly problematic as the number of satellites in orbit grows.
With approximately 10,000 satellites currently in orbit and projections exceeding 70,000 by 2030, manual intervention simply does not scale. The problem is compounded by the complex physics of satellite RF links, which are affected by dozens of interacting variables that change in real-time.
Key factors affecting satellite links include:
"We can predict most link failures 3-5 minutes out with >90% accuracy, which gives enough time to reroute traffic before data loss."
— Constellation Space Team
The Solution: AI-Driven Prediction
The new system ingests telemetry at approximately 100,000 messages per second from satellites, ground stations, weather radar, IoT humidity sensors, and space weather monitors. It runs physics-based models in real-time, calculating full link budget equations, ITU atmospheric standards, and orbital propagation to establish what should be happening under normal conditions.
Machine learning models are then layered on top, trained on billions of data points from actual multi-orbit operations. This hybrid approach combines deterministic physics with pattern recognition from historical data.
The system uses federated learning to address data privacy concerns common among constellation operators. Each constellation trains local models on its own data, and only high-level patterns are aggregated. This enables transfer learning across different orbit types and frequency bands—learnings from LEO Ka-band links can help optimize MEO or GEO operations.
We can predict most link failures 3-5 minutes out with >90% accuracy, which gives enough time to reroute traffic before data loss.
Technical Architecture
The system is fully containerized using Docker and Kubernetes, allowing flexible deployment across multiple environments. It can be deployed on-premise for air-gapped environments, on government clouds like AWS GovCloud and Azure Government, or on standard commercial clouds.
Currently, the technology is being tested with both defense and commercial partners. The dashboard provides real-time link health monitoring with forecasts at 60, 180, and 300-second intervals, plus root cause analysis that identifies whether degradation is caused by rain fade, satellite setting below the horizon, or network congestion.
Everything is exposed via API, including:
- Telemetry ingestion endpoints
- Prediction outputs
- Topology snapshots
- An LLM chat endpoint for natural language troubleshooting
Current Limitations
Despite its capabilities, the system faces several technical challenges that the team is actively working to address. Prediction accuracy naturally degrades for longer time horizons, with reliability becoming "dicey" beyond five minutes of forecast time.
Another significant challenge is the need for more labeled failure data for rare edge cases. Machine learning models require substantial training data, and infrequent failure modes present difficulties for comprehensive model training.
The federated learning architecture also requires careful orchestration across different operators' security boundaries. Balancing the benefits of pattern aggregation with strict security and privacy requirements remains an ongoing engineering challenge.
The hard parts we're still working on: prediction accuracy degrades for longer time horizons (beyond 5 minutes gets dicey), we need more labeled failure data for rare edge cases, and the federated learning setup requires careful orchestration across different operators' security boundaries.
Looking Ahead
The development represents a significant shift from reactive to proactive satellite operations management. As the orbital environment becomes increasingly crowded with mega-constellations, automated prediction and mitigation systems will become essential for maintaining reliable communications.
The team is actively seeking feedback from professionals experienced in satellite operations, RF link modeling, or large-scale time-series prediction. They are particularly interested in understanding what would make the system truly useful in a production Network Operations Center (NOC) environment.
With continued refinement of prediction algorithms, expansion of training data, and improved federated learning orchestration, this technology could become a cornerstone of modern satellite mission assurance infrastructure.
"The hard parts we're still working on: prediction accuracy degrades for longer time horizons (beyond 5 minutes gets dicey), we need more labeled failure data for rare edge cases, and the federated learning setup requires careful orchestration across different operators' security boundaries."
— Constellation Space Team









