Site Content Audit: What to Cut, Keep, and Fix
A site content audit is a systematic review of every page on your website, designed to assess whether each piece of content is serving a business purpose, earning search visibility, or doing both. Done properly, it tells you what to keep, what to improve, what to consolidate, and what to delete entirely.
Most sites accumulate content the way offices accumulate furniture: things get added when there’s a need, rarely reviewed, and almost never removed. After three or four years, you end up with hundreds of pages competing against each other, confusing search engines and offering readers a worse experience than a well-organised site with half the volume.
Key Takeaways
- A content audit is not a content inventory. Listing your pages is the starting point, not the output.
- Traffic alone is a poor measure of content quality. Pages with zero organic traffic can still drive conversions, and pages with high traffic can actively damage your brand.
- Cannibalisation is one of the most common and most ignored problems on mid-size websites. Multiple pages targeting the same keyword split authority and confuse ranking signals.
- The default answer for underperforming content should not be deletion. Consolidation and improvement usually deliver more value than removal.
- A content audit without a decision framework produces a spreadsheet, not a strategy. Every page needs a clear next action before the audit is useful.
In This Article
- Why Most Content Audits Produce a Spreadsheet and Nothing Else
- What Data Do You Actually Need Before You Start?
- How Do You Categorise Content for a Decision Framework?
- What Does Keyword Cannibalisation Look Like in Practice?
- How Should You Handle Content That Has Backlinks but Poor Performance?
- How Do You Prioritise the Improvement Queue?
- What Role Does Content Distribution Play in an Audit?
- How Often Should You Run a Content Audit?
- How Often Should You Run a Content Audit?
- What Are the Most Common Mistakes in a Content Audit?
Why Most Content Audits Produce a Spreadsheet and Nothing Else
I’ve sat in enough agency strategy reviews to know how this usually goes. Someone exports a crawl from Screaming Frog, pulls traffic data from Google Analytics, pastes it into a sheet with colour-coded columns, and presents it as a content audit. It isn’t. It’s a content inventory with a few metrics attached.
The difference matters. An inventory tells you what exists. An audit tells you what to do about it. Without a clear decision framework applied to every URL, the spreadsheet sits in a shared drive and nothing changes. The content problems that prompted the audit in the first place continue quietly eroding your search performance and your editorial credibility.
The reason audits stall is usually one of three things: the team lacks clear criteria for what “good” looks like, there’s no ownership over the decisions, or the findings are presented to someone who doesn’t have the authority or appetite to delete content that someone else spent time writing. Content has political weight inside organisations. Acknowledging that upfront makes audits more likely to produce results.
If you’re building or refining your broader content operation, the Content Strategy & Editorial hub covers the planning, governance, and measurement frameworks that sit around a piece of work like this.
What Data Do You Actually Need Before You Start?
You don’t need every metric available. You need the right ones, pulled cleanly, and mapped to each URL before you make any decisions. Overcomplicating the data layer is one of the fastest ways to make an audit feel unmanageable.
At a minimum, you need: organic sessions over a meaningful window (12 months is usually right, though 6 months works for fast-moving sites), keyword rankings and the intent behind the primary keyword each page targets, backlink data at the page level, conversion data where it’s trackable, and the date each page was last meaningfully updated. Secondary metrics worth adding include crawl depth, internal link count, and word count as a rough proxy for content depth.
SEMrush has a solid walkthrough of the data inputs and categorisation logic for a content audit if you want a reference framework for the technical setup. Moz’s piece on content planning and budget allocation is worth reading alongside it, because the decisions you make in an audit have direct resource implications that need to be planned for.
One thing I’d push back on: don’t over-rely on traffic as your primary signal. I’ve audited sites where the highest-traffic pages were ranking for informational queries that never converted, while mid-funnel pages with modest traffic were quietly driving a disproportionate share of leads. Traffic is a signal, not a verdict. You need to understand what the page is supposed to do before you judge how well it’s doing it.
How Do You Categorise Content for a Decision Framework?
Every page on your site should end up in one of four buckets: Keep, Improve, Consolidate, or Remove. The labels aren’t original. The discipline is in applying them consistently and being honest about which pages deserve which outcome.
Keep means the page is performing well against its intent, earning rankings, contributing to the user experience, and doesn’t need significant work. These pages usually need only light maintenance: updating statistics, refreshing internal links, or checking that the call to action still reflects current positioning.
Improve is the largest category on most sites. These are pages with genuine potential that are underperforming because of thin content, poor structure, weak internal linking, or misalignment between the page’s topic and the keyword it’s targeting. Improvement is the highest-effort category but typically delivers the strongest return, because you’re building on pages that already have some authority rather than starting from scratch.
Consolidate applies when you have multiple pages covering the same topic at a similar level of depth. This is a cannibalisation problem. Two pages targeting “content strategy templates” will compete against each other in search results, splitting link equity and sending mixed signals to Google about which page should rank. The solution is to merge the strongest elements of both into one definitive page, redirect the weaker URL to the stronger one, and update internal links accordingly.
Remove is the category people resist most. But some content genuinely has no future: pages targeting keywords with no commercial relevance, product pages for discontinued offerings, blog posts that were thin when they were written and have only aged badly since. Removing or redirecting these pages reduces crawl waste, tightens topical focus, and can improve the overall quality signals Google associates with your domain.
What Does Keyword Cannibalisation Look Like in Practice?
When I was running an agency that had been producing blog content at volume for several years without a coherent editorial strategy, we found over 40 instances of cannibalisation across a single client’s site. Not 40 pairs of competing pages. 40 clusters, some of which had four or five pages all targeting variations of the same query. The client’s organic traffic had plateaued despite consistent publishing, and this was a significant part of the reason.
Cannibalisation is easy to miss if you’re looking at pages in isolation. You need to map your content against keyword targets and look for overlap. A simple way to surface it: export your keyword rankings, group by primary keyword theme, and flag any theme where more than one URL appears in the top 20 positions. Then look at which URL Google is choosing to rank for that theme and whether it’s the page you’d actually want to rank.
Often, Google is ranking a page you didn’t intend to rank for a given term, because that page has accumulated more backlinks or internal links than the page you actually optimised. That’s a signal to consolidate, redirect, and repoint your internal linking to the correct destination.
How Should You Handle Content That Has Backlinks but Poor Performance?
This is where audits get genuinely complicated, and where the “just delete it” instinct can cause real damage. A page might have negligible organic traffic and thin content, but if it has accumulated backlinks from authoritative domains, removing it without a redirect destroys that link equity. The referring domains don’t disappear. They just stop passing value to your site.
The right approach for pages with meaningful backlink profiles but weak content is almost always to improve rather than remove. Rewrite the page, deepen the content, align it properly with search intent, and update internal links to point to it. You preserve the backlink value while giving the page a genuine reason to rank.
If the page genuinely can’t be salvaged because the topic is irrelevant to your current positioning, redirect it to the most topically relevant page on your site rather than returning a 404. A 301 redirect passes most of the link equity to the destination URL. It’s not a perfect solution, but it’s considerably better than losing the value entirely.
Moz’s content strategy roadmap from Whiteboard Friday covers how to think about content decisions in the context of a longer-term strategy, which is useful context for prioritising your improvement queue after an audit.
How Do You Prioritise the Improvement Queue?
After a thorough audit on a site of any meaningful size, you’ll have more improvement tasks than capacity to execute them. Prioritisation isn’t optional. Without it, teams default to working on the pages that feel easiest or most interesting rather than the ones with the highest commercial impact.
A simple scoring approach: weight each page by its potential traffic value (current ranking position multiplied by estimated search volume for the target keyword), its proximity to conversion (bottom-of-funnel pages score higher than informational content), and the effort required to improve it. Pages that score high on the first two dimensions and low on the third are your priority queue.
In practice, this usually surfaces a handful of pages that are ranking on page two for commercially valuable terms and need a focused improvement effort to break into the top five. These are your highest-leverage opportunities. A page ranking 11th for a term with genuine commercial intent will almost certainly deliver more value from a focused improvement effort than a page ranking 45th for a broad informational query.
The Content Marketing Institute’s framework for content marketing process and planning is worth referencing when you’re building the workflow around your improvement queue. The operational discipline of how work gets assigned, reviewed, and published matters as much as the strategic decisions about what to prioritise.
What Role Does Content Distribution Play in an Audit?
An audit focused purely on on-page factors and search performance misses something important: distribution. A page might be underperforming in organic search not because the content is weak but because it has no internal links pointing to it, has never been promoted through owned channels, and has attracted no backlinks as a result. The content isn’t the problem. The distribution is.
HubSpot’s guide to content distribution strategy is a useful reference for thinking about the promotion layer. When you’re reviewing pages in your audit, it’s worth flagging separately the pages that have distribution gaps versus the pages that have content quality gaps. The remediation is different for each, and conflating them leads to teams rewriting content that would have ranked fine with better internal linking and a few targeted outreach efforts.
Internal linking is consistently underestimated. I’ve seen sites where perfectly good content was sitting three or four clicks from the homepage with almost no internal links pointing to it. Google’s crawlers were barely finding it, and readers certainly weren’t. Adding a handful of contextual internal links from high-authority pages on the same domain can move rankings on those pages without touching the content itself.
How Often Should You Run a Content Audit?
How Often Should You Run a Content Audit?
For most sites publishing at a reasonable cadence, a full audit once a year is appropriate. Quarterly reviews of your top-performing and bottom-performing content are worth building into your editorial rhythm as a lighter-touch ongoing process. The goal is to catch problems before they compound rather than running a major remediation exercise every 18 months because the site has drifted.
Sites that publish at high volume or operate in fast-moving categories may need more frequent reviews. News-adjacent content, in particular, can become outdated quickly, and pages that were accurate when published can become actively misleading if left unchecked. The reputational cost of outdated content is harder to measure than the SEO cost, but it’s real.
One thing I’ve found useful is building a simple content health dashboard that surfaces pages with declining traffic trends month-over-month. You don’t need to action every decline immediately, but having visibility means you can catch pages that are slipping before they fall off the first page entirely. Catching a page dropping from position four to position seven is a much easier fix than recovering a page that has dropped to position 30.
If you want to go deeper on the strategic and operational frameworks that make content programmes sustainable over time, the Content Strategy & Editorial hub covers everything from editorial planning to measurement and governance.
What Are the Most Common Mistakes in a Content Audit?
Auditing without a clear objective is the most common. Are you trying to improve organic search performance? Reduce crawl budget waste? Align content with a repositioned brand? The objective shapes which metrics matter and which decisions you make. An audit optimised for search performance will reach different conclusions than one optimised for conversion rate.
Deleting content without redirects is a close second. I’ve seen this happen repeatedly, usually when a developer is tasked with “cleaning up” the site without a proper brief. Pages get removed, 404s accumulate, backlink equity evaporates, and the team wonders why organic traffic dropped after a site refresh. Always check backlink profiles before removing any URL, and always redirect rather than delete where there’s meaningful equity to preserve.
The third mistake is treating the audit as a one-time project rather than the beginning of an ongoing content governance process. The audit tells you the current state. What happens after is what determines whether the site’s content quality improves over time or drifts back to the same state in two years.
There’s also a tendency to focus the audit entirely on blog content and ignore service pages, landing pages, and product content. These pages often have more direct commercial impact and more significant quality issues, precisely because they were written quickly under commercial pressure and never revisited. A complete audit covers the entire site, not just the content marketing layer.
About the Author
Keith Lacy is a marketing strategist and former agency CEO with 20+ years of experience across agency leadership, performance marketing, and commercial strategy. He writes The Marketing Juice to cut through the noise and share what works.
