A recent Devpolicy Blog post, “The science of scale: what works to implement effective education programs?” (hereafter, “the post”) discusses a session on scaling education interventions held at the 2025 Australasian Aid and International Development Conference. As factors critical for achieving scale, the article emphasises the importance of evidence, leadership, evaluation and collaboration among policymakers, implementers and evaluators.
What is scale? The core idea of scale in international development is bringing benefits to more people. Scale is defined here as the geographical spread of benefits to more districts, schools, teachers and learners. The OECD’s Glossary of Key Terms in Evaluation and Results-Based Management for Sustainable Development does not define the term scale. Its absence is consistent with the Glossary’s editorial approach to only include OECD countries’ agreed technical terms in common use.
That absence of agreement is revealing. The post to which I am responding, for example, does not define scale and that reflects the muddled state of affairs with scale where most development projects neglect to define it. Scale is linked to the idea of the sustainability of benefits, which is defined in the OECD Glossary as the extent to which the benefits of a change continue. However, sustainability and scale are different concepts: sustainability is the time dimension of change; scale is the spatial dimension.
Is scale grounded in science? Clear definitions are foundational in science but are often missing in the development and educational literature, and in professional practice. Definitions establish the boundaries of the phenomenon being studied, making claims about it measurable and falsifiable. Measurement for evaluation and research cannot produce reliable and valid data without clarity about what is being measured.
The post’s assertion that theory alone is rarely sufficient in complex systems is true. However, theory is the bedrock of science. From the limited research on scale in education, Cynthia Coburn’s work provides an example of theory-building. She argues that only defining scale as reaching more beneficiaries overlooks the challenge for project design and implementation activities to achieve deep and lasting change. Coburn conceptualises scale as having four interrelated dimensions — depth of change, sustainability, spread and a shift in reform ownership toward local actors. Scale depends on sustainability and is only significant if change is sustained in both the original and subsequent intervention sites.
The post’s focus on scale is timely. Donors’ current policy settings are far from adequate to address global development challenges. For example, Australia’s current International Development Policy does not present a policy position on achieving sustainability or scale, and the Department of Foreign Affairs and Trade (DFAT)’s International Programming Guide provides no guidance on either. Yet the development policy is based on its “grand-scale” objective of a peaceful, stable and prosperous Indo-Pacific, and on the Minister’s commitment to the Sustainable Development Goals (SDGs) that begin with the goal of ending poverty in all its forms — “everywhere”.
However, in an example of policy incoherence, DFAT’s Design and Monitoring, Evaluation and Learning Standards does set out standards for sustainability, but not for scale. Sustainability and scale are conspicuously absent in Australia’s International Development Performance and Delivery Framework, where quality criteria are limited to effectiveness, efficiency, gender equality and disability equity. Effectiveness, the extent to which a development intervention achieves its objectives and results, is rarely a relevant criterion for evaluating sustainability and scale unless they are clearly specified as project objectives.
Why sustainability and scale are so often ignored is puzzling. Both are central themes in addressing the global development emergency, according to the United Nations (UN) Secretary-General in his foreword to the 2025 Sustainable Development Goals Report. Yet the reality is that they have not been central in development projects. That omission raises the question: if sustainability and scale are central to addressing global development challenges, why are they so rarely addressed?
Evidence from Indonesia’s education sector illustrates the significance of that question. Between 1971 and 2023, Indonesia received donor support through 114 education projects worth more than US$6 billion.
Analysis of donors’ reports from this support confirms the post’s opening assertion that ” … success at pilot level does not guarantee success at scale”. Donor completion reports judged 82% of their projects to be successful. However, success at project completion did not translate into sustainability or the scaling of benefits.
Only 23 of the 114 projects (20%) were evaluated at least two years after project completion for valid evidence of the actual sustainability of project benefits. Twelve of those 23 projects (52%) showed evidence of actual sustainability. Comparisons of donors’ attention to actual sustainability show that this dimension was evaluated by the Asian Development Bank in 11 of its 27 projects and by the World Bank in four of 44 projects. Australia evaluated one of its 25 projects for actual sustainability. The evidence for scale was far weaker, with only four of the 114 projects (3%) evaluated in this dimension.
The important place of political leadership in implementation is rightly stressed in the post. But political leadership can be unreliable due to short-term instability. Donors’ reports emphasise in-country educational leadership and local ownership of change to achieve sustainable benefits at scale. Educational leadership is critical to drive depth of change. Without practices informed by a depth of understanding of educational principles and local cultural values, there is a shockingly high risk of failing to achieve desired change, as Gerard Guthrie has demonstrated in his thorough analysis from all 142 developing countries.
Is scale grounded in science? The idea of a “science of scale” remains more aspirational than realised. The post’s conclusion that ” … the science of implementation and scale is finally taking off” is not supported by the evidence presented. Francis Fukuyama cautions about being “scientific” and ignoring culture in development in his book, State Building:
The effort to be more “scientific” than the underlying subject matter permits carries a real cost in blinding us to the real complexities of public administration as it is practiced in different societies (p. 123).
If educational development projects are to move beyond successful implementation to achieve lasting, sustained change at scale, they will need coherent donor policies that require, support and reward the professional work necessary for cultural relevance, clearer definitions, stronger theoretical and evidential foundations and excellence in project implementation and evaluation, including the evaluation of sustainability and scale after project completion.
And that matters. As the UN Secretary-General has pointed out, 800 million people still live in extreme poverty. Lifting people out of poverty requires focused interventions supporting benefits at scale from sustained access to essentials like food, clean water, healthcare, education and economic opportunities. The development policy drift away from a strong focus on the sustainability and scaling of benefits from development interventions places that poverty alleviation goal at risk.


