RL-Building Generator

Agent-based Reinforcement Learning to Increase Housing Density in London

Project Overview

This research project addresses London's critical housing crisis through innovative application of agent-based reinforcement learning. With housing prices rising 130% between 2005-2023 and the city meeting only a fraction of its housing expansion targets, our study proposes an AI-driven solution to optimize urban density while maintaining livability standards.

The project combines advanced machine learning techniques with urban planning principles, utilizing multi-agent systems to identify optimal densification strategies. Through comprehensive site digitalization and intelligent agent behavior modeling, we create automated tools that can support planners, architects, and policymakers in making data-driven decisions for sustainable urban development.

Housing Crisis in London

London Housing Crisis Data Visualization

Price Surge

130%

Housing price increase between January 2005 and January 2023

Low Delivery Rate

< 50%

Of original housing expansion plan met in 2023

London faces the highest rents in the country, with particularly acute challenges in areas like Waltham Forest, which has the fourth highest overcrowding rate in Outer London at 18% of homes classified as overcrowded. This project targets these critical areas for intelligent densification solutions.

Methodology

Site Digitalization

2D/3D mapping, land use analysis, building indexing

Agent Modeling

Multi-agent systems with ground and roof agents

RL Training

120,000 step training runs with reward optimization

Optimization

Density increase while maintaining livability

Our approach integrates comprehensive site analysis with intelligent agent-based modeling. The digitalization process captures multiple data layers including land use patterns, building functions, solar exposure, and spatial relationships through quadtree algorithms for empty space identification.

The multi-agent reinforcement learning system employs distinct agent types - ground agents for horizontal expansion and roof agents for vertical densification - each trained through extensive simulation runs to optimize housing density while preserving environmental quality and regulatory compliance.

Site Digitalization & Agent Types

Land Use Analysis

• Residential areas identification
• Commercial zone mapping
• Green space preservation

Building Index

• Height and density mapping
• Function classification

Solar Analysis

• Ground radiation mapping
• Shadow impact assessment

Agent

Action

Move

• 6 directions
• up,down,front,back,left,right

Occupy

• 2 options
• Occupied or not

Observation

Coodinate

• 6 directions
• up,down,front,back,left,right

Neighbor

• 25 + 8 + 1 positions
• Also check celltypes

If agent has neightbors underneath

✅AddReward(0.04f * underbuildingCell)
✅AddReward(2.0);(Exactly above)
❌AddReward(-2.0);(No exactly above cube)

If agent has neightbors surrounding

✅AddReward(1.0)
❌AddReward(-1.0)
• The first cube always gets reward

If agent's footprint in residential area

✅AddReward(1.0)
❌AddReward(-1.0)

If agent's footprint away from building

✅AddReward(distance - 2.0)

If agent on ground

✅AddReward(0.4 - (0.1 * floor));
• Only the first six cubes

BoundingBox Reward

Compactness = volumeRatio - diagonalRatio
✅AddReward(Compactness > 0)
❌AddReward(Compactness < 0)

Mean Solar Index Reward

☀️MeanSunIndex = Sum(sun_map) / (width * length)
SolarDelta = (MeanSunIndex - InitialMeanSunIndex) * 50 (negative)
From the ground cast a ray to sun and check obstacles, if not exists, the value of this position in the sun_map +1.
select winter solstice’s sunlight vector per hour in London as our sunvector input

Training Results

Orthogonal Building Plot

Before Training

After Training

Non-orthogonal Building Plot

Before Training

After Training

Plot without Greenery

Before Training

After Training

Key Performance Metrics

Training Parameters:

• Training Steps: 120,000 per run
• Training Duration: ~1.2 hours
• Site Dimension: 400 x 400 x 40m
• Voxel Resolution: 8 x 8 x 8

Optimization Targets:

• Maximize housing density
• Improve sunlight access
• Maintain plot ratio compliance
• Preserve green spaces

Case Study: Waltham Forest

Site Selection Criteria

Mixed land use covering residential, commercial, and green spaces
Well-connected transportation networks and road junctions
Sufficient site dimension (400x400x40m) for comprehensive analysis
Representative of broader London housing challenges

Application & Optimization

Agent Performance

Ground Agents 257.48 (smoothed value)

Roof Agents -648.56 (optimizing)

Training duration: 1.2 hours per 120,000 steps

Optimization Results

Density Optimization ✓ Achieved

Sunlight Access ✓ Improved

Regulatory Compliance ✓ Maintained

Green Space Preservation ✓ Protected

Comparison with Traditional Urban Planning

Process Stage	Traditional Planning	AI-Driven Approach
Site Analysis	Manual assessment of physical characteristics	Automated plot digitalization with 2D/3D mapping
Contextual Studies	Analyze surrounding architecture manually	Comprehensive building index and distance mapping
Program Development	Define use mix based on stakeholder input	Automated land use optimization
Initial Massing	Preliminary diagrams considering solar/wind	Ground solar radiation mapping integration
Training/Iteration	Manual refinement based on feedback	120,000 steps automated training run
Final Production	Human designer creates final design	AI-generated optimized urban layout