A PPO reinforcement-learning agent that learns satellite station-keeping at the Earth–Moon L1 Lagrange point. The physics is a custom CR3BP (circular restricted three-body problem) engine, and a Three.js/WebGL viewer renders the trajectories live.
Selected work