
RL-Building Generator
Agent-based Reinforcement Learning to Increase Housing Density in London
Project Overview
This research project addresses London's critical housing crisis through innovative application of agent-based reinforcement learning. With housing prices rising 130% between 2005-2023 and the city meeting only a fraction of its housing expansion targets, our study proposes an AI-driven solution to optimize urban density while maintaining livability standards.
The project combines advanced machine learning techniques with urban planning principles, utilizing multi-agent systems to identify optimal densification strategies. Through comprehensive site digitalization and intelligent agent behavior modeling, we create automated tools that can support planners, architects, and policymakers in making data-driven decisions for sustainable urban development.
Housing Crisis in London

Price Surge
130%
Housing price increase between January 2005 and January 2023
Low Delivery Rate
< 50%
Of original housing expansion plan met in 2023
London faces the highest rents in the country, with particularly acute challenges in areas like Waltham Forest, which has the fourth highest overcrowding rate in Outer London at 18% of homes classified as overcrowded. This project targets these critical areas for intelligent densification solutions.
Methodology

Site Digitalization
2D/3D mapping, land use analysis, building indexing
Agent Modeling
Multi-agent systems with ground and roof agents
RL Training
120,000 step training runs with reward optimization
Optimization
Density increase while maintaining livability
Our approach integrates comprehensive site analysis with intelligent agent-based modeling. The digitalization process captures multiple data layers including land use patterns, building functions, solar exposure, and spatial relationships through quadtree algorithms for empty space identification.
The multi-agent reinforcement learning system employs distinct agent types - ground agents for horizontal expansion and roof agents for vertical densification - each trained through extensive simulation runs to optimize housing density while preserving environmental quality and regulatory compliance.
Site Digitalization & Agent Types


Land Use Analysis

- • Residential areas identification
- • Commercial zone mapping
- • Green space preservation
Building Index

- • Height and density mapping
- • Function classification
Solar Analysis

- • Ground radiation mapping
- • Shadow impact assessment
Agent

Move

- • 6 directions
- • up,down,front,back,left,right
Occupy

- • 2 options
- • Occupied or not
Coodinate

- • 6 directions
- • up,down,front,back,left,right
Neighbor

- • 25 + 8 + 1 positions
- • Also check celltypes
If agent has neightbors underneath

- ✅AddReward(0.04f * underbuildingCell)
- ✅AddReward(2.0);(Exactly above)
- ❌AddReward(-2.0);(No exactly above cube)
If agent has neightbors surrounding

- ✅AddReward(1.0)
- ❌AddReward(-1.0)
- • The first cube always gets reward
If agent's footprint in residential area

- ✅AddReward(1.0)
- ❌AddReward(-1.0)
If agent's footprint away from building

- ✅AddReward(distance - 2.0)
If agent on ground

- ✅AddReward(0.4 - (0.1 * floor));
- • Only the first six cubes
BoundingBox Reward

- Compactness = volumeRatio - diagonalRatio
- ✅AddReward(Compactness > 0)
- ❌AddReward(Compactness < 0)
Mean Solar Index Reward

- ☀️MeanSunIndex = Sum(sun_map) / (width * length)
- SolarDelta = (MeanSunIndex - InitialMeanSunIndex) * 50 (negative)
- From the ground cast a ray to sun and check obstacles, if not exists, the value of this position in the sun_map +1.
- select winter solstice’s sunlight vector per hour in London as our sunvector input
Training Results



Orthogonal Building Plot


Non-orthogonal Building Plot


Plot without Greenery


Key Performance Metrics
Training Parameters:
- • Training Steps: 120,000 per run
- • Training Duration: ~1.2 hours
- • Site Dimension: 400 x 400 x 40m
- • Voxel Resolution: 8 x 8 x 8
Optimization Targets:
- • Maximize housing density
- • Improve sunlight access
- • Maintain plot ratio compliance
- • Preserve green spaces
Case Study: Waltham Forest

Site Selection Criteria
- Mixed land use covering residential, commercial, and green spaces
- Well-connected transportation networks and road junctions
- Sufficient site dimension (400x400x40m) for comprehensive analysis
- Representative of broader London housing challenges
Application & Optimization


Agent Performance

Training duration: 1.2 hours per 120,000 steps
Optimization Results

Comparison with Traditional Urban Planning
Process Stage | Traditional Planning | AI-Driven Approach |
---|---|---|
Site Analysis | Manual assessment of physical characteristics | Automated plot digitalization with 2D/3D mapping |
Contextual Studies | Analyze surrounding architecture manually | Comprehensive building index and distance mapping |
Program Development | Define use mix based on stakeholder input | Automated land use optimization |
Initial Massing | Preliminary diagrams considering solar/wind | Ground solar radiation mapping integration |
Training/Iteration | Manual refinement based on feedback | 120,000 steps automated training run |
Final Production | Human designer creates final design | AI-generated optimized urban layout |