Proof of concept

Green summarisation

A proof of concept from the AI Capability and Enablement team.

The problem

Earlier proofs of concept showed that large language models (LLMs) produce high-quality text summaries. But LLMs run on cloud infrastructure and consume significant energy per call.

Defra owns the government's environmental agenda, so the energy cost of AI is a real consideration here, not a footnote. See Sustainability for how that ties to the Greening Government Commitments.

We wanted to find out whether smaller models, running locally, could do the same job at a fraction of the cost.

The hypothesis

Non-LLM transformer models can summarise documents and meeting transcripts to a comparable standard, while using fewer resources and running locally.

We set out to answer 3 questions:

can smaller, local models match LLM summarisation quality?
how do we measure summary quality objectively?
what are the cost and speed trade-offs between local models and cloud-based LLMs?

What we found

Smaller transformer models can summarise text, but quality depends on expectations.

For concise summaries, the tested models produced shorter output that was semantically similar to the LLM baseline. They captured the core meaning.

For longer, more detailed summaries, the transformer models could not match LLMs. Their output length limits made detailed summaries impractical.

Running models locally also keeps the source text on Defra infrastructure. That suits sensitive material such as meeting transcripts, which a public consumer tool should not see.

Check the defaults in Using data with AI before pointing any tool at real content.

Before this approach goes further, it needs a clear way to measure whether a summary is good enough to trust.

Limitations

This was a short spike, so the findings are early.

We tested only 2 models, and ran inference on CPU only, so the speed numbers would improve a lot on GPU.

The metrics we used, ROUGE and BERTScore, measure textual overlap but may not capture what makes a summary genuinely useful to a reader.

Project details

Duration: 8 January 2025 to 14 February 2025
Status: Completed
View the green summarisation project on GitHub

Contribute a pattern

If your team has a reusable approach you want to share, the AI Capability and Enablement team can help you write it up.

Email: AICapabilityAndEnablement@defra.gov.uk

Cookies on the Defra digital service manual