Agglomeration: Why Cities Exist and Why They Cost So Much

The Question

Why do firms and workers pay dramatically higher costs to locate in cities when they could operate more cheaply elsewhere — and who captures the surplus that clustering creates?

56%

World population in urban areas

UN, 2024

80%+

Global GDP produced in cities

World Bank

~33%

Raw urban wage premium

Glaeser & Maré 2001

β ≈ 1.15

Universal scaling exponent

Bettencourt et al. 2007

Figure 1

The Agglomeration-Congestion Tradeoff

This is the fundamental tradeoff of urban economics. Cities exist because clustering creates surplus — knowledge spillovers, thick labor markets, shared infrastructure. But clustering also creates costs — congestion, high rents, pollution. The question is not whether cities are good or bad. It is whether a specific city is the right size. To the left of the crossing, another person moving in creates net value. To the right, they impose net costs.

Why Firms Cluster — The Three Forces of Agglomeration

Alfred Marshall identified three forces in 1890. Rossi-Hansberg's course formalized them. Every city exists because at least one of these forces generates enough surplus to justify the costs of proximity.

Force 1: Knowledge Spillovers

Production externalities — firms learn from nearby firms

Ideas diffuse through proximity. This is why tech clusters in San Francisco, finance in New York, film in Los Angeles. Greenstone, Hornbeck & Moretti (2010)^[4] showed that when a large plant opens, nearby firms' productivity increases — even in different industries. The spillover is not about what you produce. It is about what you learn from the people next to you. Chatterji, Glaeser & Kerr found that entrepreneurship clusters are self-reinforcing: once a critical mass of startups exists, the next startup is more likely to succeed because of accumulated local knowledge.

Force 2: Thick Labor Markets

Labor pooling — better matching between firms and workers

A firm in a big city can find a specialist; a specialist in a big city can find a firm that needs them. This reduces friction and mismatch costs. Moretti's "Forces of Attraction"^[2] describes the multiplier: each additional skilled worker in a city raises the productivity of all other workers by improving match quality. The labor market becomes thicker — more options, better fits, faster reallocation when firms expand or contract. Rural labor markets are thin: one employer, take-it-or-leave-it. Urban markets are thick: many employers, bargaining power, specialization.

Force 3: Input Sharing

Supplier access — lower costs for inputs and outputs

Firms in cities face lower transportation costs for inputs and outputs. The classic comparison is Saxenian (1996)^[9]: Silicon Valley's dense network of specialized suppliers versus Route 128's isolated, vertically-integrated firms. Valley startups could prototype in weeks because every component supplier was within a 30-minute drive. Route 128 firms had to build everything internally. Proximity to suppliers and customers reduces costs and accelerates iteration. This force matters most for manufacturing and physical goods, less for pure services.

The Unifying Mechanism

Luis Bettencourt showed that these three forces are really one force — and that the relationship between city size and social output is mathematically precise and universal across cultures and centuries. Section 03 unpacks the quantitative framework.

Figure 2

Agglomeration Forces by Industry

Not all industries cluster for the same reason. Tech firms cluster to learn from each other (knowledge spillovers dominate). Finance firms cluster to access specialized talent (labor matching). Manufacturers cluster near suppliers (input sharing). The force that dominates determines which cities attract which industries — and explains why losing a single anchor firm can unravel an entire local cluster.

The Urban Wage Premium and Learning

The most striking empirical finding in urban economics: big-city workers earn dramatically more, and the premium is not just sorting.

Key Finding

De la Roca & Puga (2017)^[5] showed that workers in big cities earn more not just because of sorting — talented people moving to cities — but because cities accelerate learning. The wage premium grows with experience. A year of experience in New York is worth more than a year in Des Moines because you learn faster when surrounded by more and better peers. The city is not just paying you more. It is making you better.

Glaeser & Maré (2001)^[1] estimated the raw urban wage premium at roughly 33% in large metros. Even after controlling for education, experience, occupation, and industry, workers in big cities earn 4-11% more. The residual — the part that cannot be explained by worker characteristics — is the genuine agglomeration effect. Combes, Duranton, Gobillon & Roux (2012)^[10] confirmed this with French data: the productivity advantage of large cities is real, not an artifact of selection.

"The city makes the worker. Not the other way around. The wage premium grows with tenure because proximity compounds learning — every conversation, every observation, every accidental encounter at a coffee shop adds to the stock of human capital that isolation cannot replicate."

Figure 3

Urban Wage Premium by Metro Population

Part of the urban wage premium is sorting — talented people move to cities (the gap between the two lines). But even controlling for education, experience, and industry, big-city workers earn significantly more. That residual is the agglomeration effect: the productivity boost of being surrounded by other productive people. Glaeser and Maré (2001) estimate the raw premium at ~33% in large metros, falling to 4-11% after individual fixed effects. De la Roca and Puga (2017) show it compounds with experience — a year in NYC teaches more than a year in Des Moines.

The Scaling Laws — Bettencourt's Quantitative Framework

Luis Bettencourt took the agglomeration intuition and turned it into a measurable law. His Settlement Scaling Theory^[11] — developed at the Santa Fe Institute and now at Chicago's Mansueto Institute for Urban Innovation — shows that cities are not just big towns. They are a different kind of system entirely, and the difference is mathematically precise.

The Core Insight

When a city doubles in size, it does not simply get twice as much of everything. Socioeconomic outputs — patents, GDP, wages, but also crime and disease — increase by roughly 115%. Infrastructure needs — roads, electrical cables, gas stations — increase by only ~85%. The city gets a 15% bonus on everything social and a 15% discount on everything physical.

The scaling exponent β ≈ 1.15 is remarkably consistent — not a fixed constant, but a central tendency. It holds across US metros, European cities, Chinese prefectures, and ancient settlements, with individual exponents varying by indicator and dataset. Bettencourt's 2013 Science paper^[13] showed this is not an accident — it emerges from the geometry of social networks embedded in physical space.

Figure 4

Urban Scaling Exponents (β) — Bettencourt et al.

Bettencourt's scaling laws across ~360 US metropolitan areas. When a city doubles in population, socioeconomic outputs (patents, GDP, wages, crime) increase by roughly 115% — a 15% bonus from agglomeration. But infrastructure needs (roads, cables, gas stations) increase by only 83-87% — economies of scale from shared networks. Individual needs (housing, water, jobs) scale linearly. Strumsky, Bettencourt & Lobo (2023) confirmed these effects operate at sub-metropolitan scales too — counties and urbanized areas, not just MSAs. Data from Bettencourt et al. (2007, 2010, 2013, 2023).

What makes this more than a parlor trick is Scale-Adjusted Metropolitan Indicators (SAMIs)^[12] — Bettencourt's tool for asking: is this city performing above or below what its size predicts? Strip out the population effect and you get a dimensionless residual. San Francisco overperforms on patents. Memphis underperforms on income. The SAMIs persist for decades — city identity is structural, not accidental. Bettencourt found autocorrelation decay times of ~19 years for patents and ~35 years for income. Five distinct families of "kindred cities" emerged from the data, clustered not by geography but by economic model.

What This Explains

Why cities produce disproportionately more innovation per capita. Why infrastructure gets cheaper per person as cities grow. Why urbanization is the dominant trend of human history regardless of culture or era.

The Dark Side of 1.15

The same exponent applies to crime, pollution, and disease. Cities superlinearly produce everything social — good and bad. β = 1.16 for serious crime means doubling population increases crime by 116%, not 100%. The surplus and the pathology are the same phenomenon.

Congestion — The Price of Proximity

Every agglomeration benefit has a shadow. Commuting, housing costs, pollution, crowding, crime, inequality. Turner (2010) established the fundamental law of road congestion: building more roads induces proportional increases in driving. You cannot build your way out of traffic. Baum-Snow (2007) showed that highways caused suburbanization — redistributing people across space without reducing congestion at the center.

The key distributional insight: congestion costs are borne unevenly. High-skill workers capture agglomeration benefits through higher wages that more than compensate for higher rents. Low-skill workers bear congestion costs — longer commutes, worse air quality, higher rents — without proportional wage gains. This creates the urban inequality paradox: cities are where opportunity concentrates, but the cost of accessing that opportunity rises faster than the wages of the people who need it most.

The Distributional Problem

Hsieh & Moretti (2017)^[8] estimated that housing regulations in just New York, San Francisco, and San Jose lowered aggregate US GDP growth by 36% from 1964 to 2009. The mechanism: restrictive zoning prevents people from moving to high-productivity cities. Workers who would be more productive in SF stay in lower-productivity cities because they cannot afford the rent. The agglomeration surplus exists but is locked behind a housing gate — and the key is held by incumbent homeowners and landowners who benefit from artificial scarcity.

Figure 5

Who Captures the Agglomeration Surplus?

The fundamental distributional problem of cities: agglomeration creates enormous surplus, but roughly 60-70% is captured by landowners through higher property values and rents. High-skill workers capture 20-25% through wage premiums. Low-skill workers — who provide essential services that make cities function — often end up worse off after rent. Henry George identified this problem in 1879. The surplus is real; the question is who gets it.

City Size Distribution — Zipf's Law

An empirical regularity so robust it borders on a natural law: city sizes follow a power law distribution. The second largest city is roughly half the size of the largest, the third is roughly a third, and so on. This is Zipf's Law, and it holds across countries and time periods. Soo (2005) tested it across 73 countries. Gabaix (1999) provided a theoretical foundation.

Why does this matter for policy? Because it constrains what is possible. You cannot have ten cities the same size. The distribution is hierarchical by nature — a few very large cities, many medium cities, a long tail of small ones. Policy that tries to create a "second Silicon Valley" without understanding this regularity is fighting gravity. The rank-size distribution emerges from the same agglomeration-congestion tradeoff in Figure 1, applied across all cities simultaneously with random growth shocks (Gibrat's Law).

Figure 6

Zipf's Law: US Metro Size Distribution (log-log)

Zipf's Law is one of the most robust empirical regularities in economics. City sizes follow a power law distribution — the second-largest city is roughly half the size of the largest, the third is roughly a third, and so on. The rank-size relationship is remarkably stable across countries and centuries. This means city hierarchies are structural, not accidental. Policy that tries to create a "second Silicon Valley" without understanding this distributional regularity is fighting gravity. Soo (2005) tested Zipf across 73 countries and found the pattern holds nearly everywhere.

Metro Profiles

New York20.1Mmetro pop

Largest US metro — raw wage premium of +45%, adjusted +30% after controlling for worker characteristics.

Wage Premium

+45% raw / +30% adjusted

Zipf Deviation

Closely matches Zipf prediction for rank #1

Scaling Profile

Superlinear in patents, GDP, wages, and crime (beta approx 1.15)

Resilience

Diversified metro — high resistance and fast recovery post-shock

Dominant Force

Labor matching dominates: deepest talent pool in finance, media, and professional services

Comparison

SF matches on adjusted wage premium per capita but at 1/4 the population. Chicago earns less per person despite 50% of NYC population.

San Francisco Bay+32%adj. wage premium

Highest adjusted wage premium in the US — knowledge spillovers in tech drive superlinear output.

Wage Premium

+48% raw / +32% adjusted

Zipf Deviation

Slightly above Zipf prediction for rank #12

Scaling Profile

Overperforms on patents (SAMI positive for decades)

Resilience

Diversified but housing-constrained — Hsieh & Moretti estimate SF zoning alone costs significant GDP

Dominant Force

Knowledge spillovers dominate: tech cluster is self-reinforcing per Chatterji, Glaeser & Kerr

Comparison

Outperforms NYC on per-capita innovation despite 1/4 the population. Austin aspires to replicate but lacks the institutional density.

Chicago9.5Mmetro pop

Third-largest US metro — diversified economy with moderate wage premium and strong resilience profile.

Wage Premium

+26% raw / +16% adjusted

Zipf Deviation

Matches Zipf prediction almost exactly for rank #3

Scaling Profile

Linear scaling — diverse but no single superlinear sector dominates

Resilience

Diversified metro — finance, manufacturing, logistics, healthcare provide portfolio protection

Dominant Force

All three forces roughly equal: thick labor markets across sectors, knowledge base in finance and consulting, input sharing via logistics hub

Comparison

Lower per-capita premium than Boston or DC despite larger population — suggests size alone is not sufficient without specialization in high-spillover sectors.

Nashville+8%adj. wage premium

Fast-growing mid-size metro — modest adjusted premium but accelerating agglomeration dynamics.

Wage Premium

+14% raw / +8% adjusted

Zipf Deviation

Near bottom of top 35 at rank #35 with 2.0M — room to grow within Zipf distribution

Scaling Profile

Transitioning from linear to superlinear as healthcare and music industries reach critical mass

Resilience

Related variety in healthcare, entertainment, and logistics provides emerging diversification

Dominant Force

Labor matching is strengthening: healthcare sector creates thick specialized labor market

Comparison

Similar adjusted premium to Austin (+9%) but at smaller scale. Growth trajectory suggests Nashville is on the upward portion of the agglomeration curve (left of crossing in Figure 1).

Detroit4.4Mmetro pop

Single-industry cautionary tale — once 4th largest US city, now demonstrates non-recoverable shock pattern.

Wage Premium

Below national average after auto industry decline

Zipf Deviation

Below Zipf prediction — historically larger, now rank #14 and shrinking

Scaling Profile

Underperforms SAMI on income and patents — structural decline persists for decades per Bettencourt

Resilience

Non-recoverable quadrant: low resistance, slow recovery (see Figure 8)

Dominant Force

Input sharing was dominant (auto supply chain) — when the anchor firm left, the entire cluster unraveled

Comparison

Opposite of Munich or London: concentrated rather than diversified. The China Shock (Autor, Dorn & Hanson 2013) hit manufacturing-dependent metros hardest.

Austin+9%adj. wage premium

The "second Silicon Valley" candidate — growing fast but still below the agglomeration threshold of the original.

Wage Premium

+16% raw / +9% adjusted

Zipf Deviation

Rank #28 with 2.3M — well below the top tier, growth pushing it up the distribution

Scaling Profile

Emerging superlinear signatures in tech patents but not yet at SF levels

Resilience

Increasingly diversified: tech + government + university create related variety

Dominant Force

Knowledge spillovers are building but still thin compared to SF or NYC density

Comparison

Zipf distribution constrains how large Austin can get. Policy trying to create a "second Silicon Valley" fights the structural regularity of city size hierarchies.

Metro profiles synthesize data from Figures 1-8. Wage premiums from Glaeser & Maré (2001) and De la Roca & Puga (2017). Scaling from Bettencourt et al. (2007, 2010). Resilience from Martin et al. (2016) and Dijkstra et al. (2015). Zipf deviations computed against rank-size prediction. Click any card to expand.

Gentrification, Displacement, and Moving to Opportunity

Chetty & Hendren (2015)^[6] established one of the most important findings in modern economics: neighborhoods affect long-term outcomes. Growing up in a high-mobility neighborhood measurably improves children's adult earnings, college attendance, and likelihood of marriage. The effects are causal — this is not just selection. Where you grow up is an investment, not just a consumption choice.

But here is the tension that urban economics has no clean answer to: gentrification research shows that when capital reinvests in previously disinvested neighborhoods, it creates displacement pressure. The same market forces that make some neighborhoods "high-opportunity" — better schools, lower crime, higher property values — are the forces that price out the people who live there. See the housing aesthetics musing on how the reinvestment cycle standardizes the built environment.

We know neighborhoods matter for outcomes (Chetty). We also know that improving neighborhoods often displaces the people who live there (gentrification literature). Urban economics has no clean solution to this. The same force — agglomeration surplus flowing into land values — creates opportunity and destroys access to it simultaneously.

Figure 7

Neighborhood Effects on Adult Income (Chetty MTO Data)

Raj Chetty's Moving to Opportunity research showed that where you grow up changes what you earn as an adult. Children who moved to high-opportunity neighborhoods before age 13 earned roughly 31% more as adults — the effect is causal, not just selection. But moving after age 13 yields much smaller gains, and moving to another low-opportunity area yields almost nothing. Location is an investment in future human capital — but access to "good" locations is rationed by price. Whiskers show approximate 95% confidence intervals.

Climate Change and the Geography of Agglomeration

Desmet & Rossi-Hansberg (2015)^[7] modeled how climate change will reshape location decisions over the next century. Their central insight: the geography of agglomeration is not fixed. Cities in currently temperate zones will become more valuable as southern regions become less habitable. Coastal cities face compound risks — sea level rise, storm surge, flooding — that will erode the agglomeration surplus that currently justifies their existence.

The implication is a massive reallocation of economic activity over time. The Rust Belt — currently losing population to the Sun Belt — may reverse its decline as climate makes northern cities relatively more attractive. But agglomeration effects are path-dependent: once a cluster forms, it is self-reinforcing. Rebuilding agglomeration in new locations is far harder than maintaining it where it already exists. The question is whether the climate transition will be gradual enough for markets to adjust, or abrupt enough to strand capital and destroy existing clusters.

Spatial Resilience — Do Cities Bounce Back?

The standard agglomeration story implies cities should be resilient — diversified economies, thick labor markets, and knowledge spillovers should cushion shocks. The evidence is more complicated. Martin, Sunley, Gardiner & Tyler (2016)^[15] studied how UK regions responded to four major recessions over 40 years (1974–76, 1979–83, 1990–93, 2008–10). They found that economic structure influenced resistance and recovery, but "region-specific competitiveness effects" — human capital, social capital, institutional quality — mattered as much or more than density alone.

The 2008 financial crisis challenged the agglomeration-equals-resilience assumption directly. Dijkstra, Garcilazo & McCann (2015)^[16] found that European urban regions contracted more severely than intermediate and rural regions during the crisis. Capital metro regions — London, Madrid, Dublin — that had led pre-crisis growth became central to the problem. The same concentration that amplified gains amplified losses.

Figure 8

Urban Resilience Framework: Resistance vs. Recoverability

Stylized resilience framework based on Martin et al. (2016) and Dijkstra et al. (2015). Resistance measures how much employment dropped during recession; recoverability measures speed of return to pre-shock levels. Diversified metros (related + unrelated variety) cluster in the resilient quadrant. Single-industry cities — Detroit, Flint, Youngstown — sit in the non-recoverable quadrant. Financial centers like Dublin took large initial hits but recovered faster than manufacturing towns. Positions are illustrative of the pattern, not exact empirical coordinates.

What determines whether a city bounces back? Frenken, Van Oort & Verburg (2007)^[17] provided the key distinction: related variety versus unrelated variety. Related variety — industries that share knowledge bases — generates Jacobs externalities that drive employment growth. Unrelated variety — a portfolio of dissimilar industries — dampens unemployment during shocks. A city with both biotech and finance is more resilient than one with just biotech, not because finance helps biotech, but because when one sector contracts the other absorbs displaced workers.

COVID-19 tested this at unprecedented speed. Florida, Rodríguez-Pose & Storper (2023)^[18] argued the pandemic was unlikely to alter the "winner-take-all" economic geography of the global city system. Dense cities took larger initial hits — office vacancies, transit collapse, service sector layoffs — but their structural advantages (deep talent pools, institutional density, capital access) persisted. The forced remote-work experiment revealed that proximity still matters for coordination, mentorship, and trust-building, even when communication technology is free. Early evidence shows partial dispersal but persistent clustering for knowledge-intensive work.

The China Shock — Autor, Dorn & Hanson (2013) — provides the clearest evidence on technological and trade shocks. Rising Chinese imports between 1990 and 2007 caused higher unemployment, lower labor force participation, and reduced wages in manufacturing-dependent local labor markets. Import competition explained one-quarter of the aggregate decline in US manufacturing employment. The regions that never recovered were the specialized ones — single-industry towns without the variety to pivot. Agglomeration economies with diversified knowledge bases adapted; concentrated factory towns did not.

The Resilience Paradox

The same mechanism that makes cities productive — concentration — makes them fragile to shocks that hit their core industry. Resilience comes not from density per se but from diversity within density. A city with 10 million people in one sector is more vulnerable than a city with 5 million people across 20 sectors. The portfolio logic of unrelated variety operates at the urban scale exactly as it operates in financial markets: diversification does not prevent losses, but it prevents catastrophic ones.

Competing Explanations

New Economic Geography (Krugman)

Trade costs and increasing returns explain city formation. Nobel Prize 2008.

Cities emerge at points where transportation networks converge — ports, rail junctions, highway intersections. When transport costs fall below a threshold, firms concentrate to exploit scale economies. Correct about trade and transportation, but underweights knowledge spillovers. In a world where information moves at the speed of light, the model struggles to explain why firms still cluster in expensive cities when they could ship goods from anywhere.

Classical Agglomeration (Marshall / Rossi-Hansberg)

Knowledge spillovers, thick labor markets, and input sharing. The Booth 33454 framework.

The three forces are well-identified empirically — Greenstone, Hornbeck & Moretti (2010) for spillovers; Moretti (2012) for labor matching; Saxenian (1996) for input sharing. Ahlfeldt, Redding, Sturm & Wolf (2014) used the Berlin Wall as a natural experiment to identify agglomeration effects precisely. Strong on mechanisms, weaker on distributional consequences — who gets the surplus, who bears the costs.

Political Economy (Harvey / Brenner / George)

Cities as sites of capital accumulation. The distributional question is central, not peripheral.

Harvey's "spatial fix" — capital flows into the built environment when profit rates fall elsewhere, creating real estate bubbles and displacement cycles. Brenner's "new state spaces" — cities compete for mobile capital by offering tax breaks and deregulation. Correct about power and who captures surplus, but thin on the microeconomic mechanisms that create the surplus in the first place.

Settlement Scaling Theory (Bettencourt)

Cities as quantitative universals. Agglomeration derived from first principles.

Population density increases the rate of social interactions, which increase socioeconomic outputs superlinearly (β ≈ 1.15) while infrastructure scales sublinearly (β ≈ 0.85). Holds across cultures, centuries, and development levels — from Roman settlements to modern metropolises. Unifies Marshall's three forces under a single geometric mechanism. Strongest on universality and prediction, less focused on policy levers. The SAMIs metric gives cities a performance score independent of size.

What Would Falsify This?

If remote work permanently eliminates the urban wage premium — meaning the agglomeration effect was always just sorting, and proximity adds nothing once communication technology is good enough. Early post-COVID evidence suggests partial dispersal but persistent clustering for knowledge-intensive work.
If cities that restrict housing supply (San Francisco, London) outperform cities that allow growth (Houston, Tokyo) on per-capita productivity — meaning the housing constraint is not a binding limit on agglomeration benefits.
If Zipf's Law breaks down for a sustained period in a major country — meaning city size distributions are policy-driven rather than structurally determined. China's deliberate city-building is the most interesting test case.
If Chetty's neighborhood effects disappear in a large-scale replication — meaning location doesn't actually matter for human capital formation, and sorting explains all observed differences.
If a second tech hub achieves Silicon Valley-level innovation output without proximity — meaning agglomeration is not necessary for knowledge spillovers at the frontier. No candidate has yet succeeded.
If diversified cities recover no faster from shocks than specialized ones — meaning related variety provides no resilience advantage and the portfolio logic of urban economies is wrong. Post-China Shock and post-COVID data are the live tests.

So What?

Agglomeration economics is not an abstract theory about cities. It is the mechanism behind most of the economic phenomena that shape individual lives: why some places have opportunity and others do not, why housing is expensive where jobs are good, why inequality concentrates geographically, why some industries cannot exist outside of specific cities.

For Policy

Housing regulation is the most important agglomeration policy. Hsieh & Moretti's^[8] 36% GDP loss estimate means restrictive zoning costs more than most transfer programs deliver. The distributional fix is not to slow cities down but to build more housing in the places where agglomeration surplus is highest.

For Business

Location is a productivity input, not a cost center. The remote-work debate missed this: the question is not whether people can work from home but whether they learn as fast in isolation. For knowledge-intensive firms, agglomeration is the R&D budget you do not see on the balance sheet.

For Individuals

Where you live early in your career has compounding effects. De la Roca & Puga's^[5] finding — that a year of big-city experience is worth more than a year elsewhere — means the decision to stay or leave a major city is an investment decision, not a lifestyle choice. The returns are highest when you are young and learning fastest.

This connects to everything else I have written about. Bourdieu's habitus shows how location shapes perception — what you see as possible depends on where you grow up. The housing aesthetics piece explains why agglomeration-driven development standardizes the built environment — when land values rise, the economics of construction converge on the same design. The fertility economics work shows how the cost of living in agglomeration centers suppresses reproductive decisions — cities are where human capital compounds but also where the opportunity cost of children is highest.

Sources

[1]Glaeser, E. & Maré, D. "Cities and Skills." Journal of Labor Economics, 2001.Foundational estimate of the urban wage premium at ~33% in large metros. Even after controlling for worker characteristics, 4-11% residual remains — the genuine agglomeration effect.

[2]Moretti, E. The New Geography of Jobs. Houghton Mifflin Harcourt, 2012.Best empirical treatment of labor market agglomeration. The "multiplier" framework: each additional skilled job in a city creates roughly 5 additional local service jobs.

[3]Ahlfeldt, G., Redding, S., Sturm, D. & Wolf, N. "The Economics of Density: Evidence from the Berlin Wall." Econometrica, 2015.Natural experiment using the Berlin Wall to identify agglomeration and dispersion forces separately. Gold-standard causal identification of production and residential externalities.

[4]Greenstone, M., Hornbeck, R. & Moretti, E. "Identifying Agglomeration Spillovers: Evidence from Winners and Losers of Large Plant Openings." Journal of Political Economy, 2010.Shows that when a large plant opens, nearby incumbent firms experience productivity gains — direct evidence of knowledge spillovers operating across industries.

[5]De la Roca, J. & Puga, D. "Learning by Working in Big Cities." Review of Economic Studies, 2017.Separates sorting from learning. The wage premium grows with experience in big cities — a year in NYC teaches more than a year in Des Moines. Cities make workers more productive.

[6]Chetty, R. & Hendren, N. "The Impacts of Neighborhoods on Intergenerational Mobility." Quarterly Journal of Economics, 2018.Where you grow up changes your adult income — causally, not just correlationally. Children who moved to high-opportunity neighborhoods before age 13 earned ~31% more as adults.

[7]Desmet, K. & Rossi-Hansberg, E. "On the Spatial Economic Impact of Global Warming." Journal of Urban Economics, 2015.Dynamic spatial general equilibrium model showing how climate change will reshape the geography of agglomeration — northern cities gain relative value as southern regions become less habitable.

[8]Hsieh, C. & Moretti, E. "Housing Constraints and Spatial Misallocation." American Economic Journal: Macroeconomics, 2019.Estimated that housing regulations in high-productivity cities (NY, SF, San Jose) lowered aggregate US GDP growth by 36% from 1964-2009. The most cited paper on the economic cost of restrictive zoning.

[9]Saxenian, A. Regional Advantage: Culture and Competition in Silicon Valley and Route 128. Harvard University Press, 1996.Classic comparison of open networks (Silicon Valley) vs. closed vertically-integrated firms (Route 128). Explains why dense supplier ecosystems and porous firm boundaries accelerate innovation.

[10]Combes, P.-P., Duranton, G., Gobillon, L. & Roux, S. "Sorting and Local Wage and Skill Distributions in France." Regional Science and Urban Economics, 2012.French firm-level data confirming that the productivity advantage of large cities is real and not an artifact of selection. Workers in denser areas are genuinely more productive.

[11]Bettencourt, L., Lobo, J., Helbing, D., Kühnert, C. & West, G. "Growth, Innovation, Scaling, and the Pace of Life in Cities." PNAS, 2007.The paper that established urban scaling laws. Socioeconomic outputs scale superlinearly (β ≈ 1.15), infrastructure scales sublinearly (β ≈ 0.85). Foundational for Settlement Scaling Theory.

[12]Bettencourt, L., Lobo, J., Strumsky, D. & West, G. "Urban Scaling and Its Deviations: Revealing the Structure of Wealth, Innovation and Crime across Cities." PLOS ONE, 2010.Introduced Scale-Adjusted Metropolitan Indicators (SAMIs) — measuring city performance independent of size. Showed that city identity persists for decades with ~19-year autocorrelation for patents.

[13]Bettencourt, L. "The Origins of Scaling in Cities." Science, 2013.Derived scaling exponents from first principles: social network geometry embedded in physical space. The theoretical foundation explaining why β ≈ 1.15 emerges universally across cultures and centuries.

[14]Strumsky, D., Bettencourt, L. & Lobo, J. "Agglomeration Effects as Spatially Embedded Social Interactions." Environment and Planning B, 2023.Extended scaling analysis to sub-metropolitan scales — counties and urbanized areas, not just MSAs. Confirmed the mechanism is physical proximity, not administrative boundaries.

[15]Martin, R., Sunley, P., Gardiner, B. & Tyler, P. "How Regions React to Recessions: Resilience and the Role of Economic Structure." Regional Studies, 2016.Analyzed four UK recessions over 40 years. Found that region-specific competitiveness effects (human capital, institutions) matter as much as economic structure for resistance and recovery.

[16]Dijkstra, L., Garcilazo, E. & McCann, P. "The Effects of the Global Financial Crisis on European Regions and Cities." Journal of Economic Geography, 2015.European urban regions contracted more severely than intermediate and rural regions during the 2008 crisis. Challenges the assumption that agglomeration automatically provides resilience.

[17]Frenken, K., Van Oort, F. & Verburg, T. "Related Variety, Unrelated Variety and Regional Economic Growth." Regional Studies, 2007.The key distinction for urban resilience: related variety drives employment growth via Jacobs externalities, while unrelated variety dampens unemployment shocks via portfolio diversification.

[18]Florida, R., Rodríguez-Pose, A. & Storper, M. "Cities in a Post-COVID World." Urban Studies, 2023.Argued the pandemic was unlikely to alter the winner-take-all economic geography. Dense cities took larger initial hits but structural advantages in talent, institutions, and capital persisted.

Housing Development, Aesthetic Uniformity, and the Political Economy of Race and Capital

Why all new buildings look the same — and what it reveals about financialized capital, regulatory conformity, racialized reinvestment, and the erasure of place. Design homogeneity is not a taste failure. It’s a market equilibrium.

Read →

Musing

Habitus and the Inability to Dream

Bourdieu’s habitus explains why poverty isn’t just material deprivation — it’s a structure of perception that makes certain futures unthinkable. Marco Garrido’s Manila fieldwork shows the mechanism in action.

Read →

Personal Coda

I took Rossi-Hansberg's Urban Economics at Booth in Winter 2019. It was the course that made me understand cities not as places but as equilibria — the outcome of millions of location decisions that aggregate into the patterns we observe. Before that course, I thought expensive cities were just popular. After it, I understood that the price IS the mechanism — it is how agglomeration surplus gets allocated, and most of it goes to people who own land rather than people who create value.

The concept that stuck hardest was the agglomeration-congestion tradeoff. Every benefit of clustering comes with a cost, and the costs do not fall on the same people who capture the benefits. High-skill workers get the knowledge spillovers and the wage premium. Low-skill workers get the congestion and the rent burden. Landowners get the surplus. That is not a policy failure — it is the market working exactly as the theory predicts. The question is whether we accept that distribution or redesign it.

Rossi-Hansberg made us think about location as an investment, not a lifestyle choice. Where you live determines what you learn, who you meet, what opportunities you can even perceive. That connects to everything I have written about — Bourdieu's habitus, Chetty's neighborhood effects, the housing piece on why new buildings all look the same. Cities are the mechanism. Agglomeration is the force. The distribution of surplus is the political question.

Luis Bettencourt's work gave me the quantitative spine for what Rossi-Hansberg taught qualitatively. The scaling laws are not metaphors — they are measurements. That changes the conversation from "should we build cities" to "how do we distribute what cities inevitably produce." The surplus and the pathology come from the same mechanism. You cannot have the innovation without the congestion. The question is whether you design the distribution or let land markets do it for you.

urban-economicsagglomerationscaling-lawsBoothBettencourtMansueto