The Content Audit Framework: What to Measure and What to Do With the Data
A content audit is a page-by-page evaluation of your entire published library against six performance dimensions: traffic, rankings, backlinks, engagement, conversion, and AI citation. The output is a categorized action list that tells you exactly which pages to keep, update, merge, or remove. Here is the complete framework for content managers running audits across 100 or more pages.
What Is a Content Audit Framework and Why Does It Matter?
- The 6 metrics every page gets evaluated against
- The tools and thresholds for each metric
- The 4-bucket categorization system that turns data into action
- The execution sequence for a team running this on 100 to 500 pages
What Are the Six Metrics Every Content Audit Must Measure?
Metric 1: Organic Traffic (Sessions Over 6 Months)
Pull GA4 session data for every URL over the trailing 6 months. Six months smooths out seasonal fluctuations and gives you enough data to identify trends. A page averaging 15 sessions per month is performing fundamentally differently from one averaging 500.- Source: GA4 Landing Page report, filtered to organic traffic
- What to flag: Pages with fewer than 50 total organic sessions over 6 months. On a site averaging 200 sessions per page, anything below 50 is in the bottom quartile and warrants investigation.
- Trend matters more than absolute numbers. A page with 80 sessions that is growing 15% month-over-month is healthier than a page with 300 sessions declining 10% monthly.
Metric 2: Keyword Rankings (Position Distribution)
Export ranking data from Google Search Console or a third-party tool like Ahrefs. For each URL, capture the number of keywords it ranks for and the position distribution across four tiers:- Positions 1-3: High-value rankings driving the majority of clicks
- Positions 4-10: Page-one visibility with moderate CTR
- Positions 11-20: Striking distance with optimization potential
- Positions 21+: Low visibility, typically below 1% CTR
Metric 3: Backlink Profile (Referring Domains)
Count the number of unique referring domains linking to each URL. Referring domains, not total backlinks, because 50 links from one domain carry less weight than 5 links from 5 different domains.- Source: Ahrefs, Semrush, or Moz. Export at the URL level.
- Threshold: Pages with 10 or more referring domains have meaningful link equity worth preserving. Pages with 0 referring domains have nothing to lose from removal or redirection.
- Quality check: Scan the referring domains list for spam. A page with 25 referring domains that are all PBN sites has a toxic profile, not a strong one.
Metric 4: Engagement Signals (Time on Page, Scroll Depth, Bounce Rate)
Engagement tells you whether visitors find the content useful after they arrive. Traffic without engagement means the page attracts clicks but fails to deliver value.- Average engagement time per session (GA4): Below 30 seconds on a 2,000-word article signals a content-quality problem
- Scroll depth (requires GA4 scroll event or Hotjar): Pages where 80% of users never scroll past the first 25% have a structural or introduction problem
- Bounce rate (GA4): Above 85% on a blog post is concerning. Above 90% is a red flag unless the page is designed for single-answer queries.
Metric 5: Conversion Contribution
Not every page needs to convert directly. But every page should play a role in at least one conversion path. Check two data points:- Direct conversions: GA4 key events attributed to the page as a landing page (form fills, demo requests, purchases)
- Assisted conversions: GA4 conversion path analysis showing the page as a touchpoint in multi-session journeys
Metric 6: AI Citation Potential
This is the metric most audit frameworks still miss. As of 2025, an estimated 40% of Google searches trigger AI Overviews, and platforms like ChatGPT, Gemini, and Perplexity are generating roughly 1 billion answers per day that cite external sources. Pages that get cited in AI responses receive referral traffic from a channel that did not meaningfully exist 18 months ago.- Does the page contain clear, definition-first paragraphs? AI systems extract concise definitions more readily than narrative introductions.
- Does it include structured data (FAQ schema, HowTo schema, Article schema)?
- Does it cite specific numbers with sources? AI models favor factual, attributed claims over unsourced assertions.
- Is the content entity-rich? Named frameworks, branded methodologies, and specific proper nouns increase citation likelihood.
What Tools Do You Need and What Thresholds Should You Set?
| Audit Metric | Tool | Threshold | Action If Below |
|---|---|---|---|
| Organic Traffic | GA4 | <50 sessions over 6 months | Candidate for removal or merge. Check remaining 5 metrics before deciding. |
| Keyword Rankings | GSC / Ahrefs | 0 keywords in positions 1-20 | Page is invisible. Evaluate whether topic has demand. If yes, rewrite. If no, remove. |
| Backlinks | Ahrefs / Semrush | <3 referring domains | Low link equity. Safe to remove or redirect without significant equity loss. |
| Engagement | GA4 / Hotjar | Avg. engagement time <30s on 1,500+ word pages | Content quality or structure problem. Rewrite introduction and reformat for scannability. |
| Conversion | GA4 Conversions | 0 direct or assisted conversions in 6 months | Add or reposition CTAs. If traffic is also low, deprioritize the page entirely. |
| AI Citation | Manual review / AI search tools | Score 1-2 on the 5-point scale | Add definition-first paragraphs, structured data, and source citations. Restructure for AI extractability. |
“The audit table is your decision engine. When a content manager can look up any page, see six numbers, and know exactly what to do next, you have eliminated the biggest bottleneck in content operations: the debate over what matters.”
Hardik Shah, Founder of ScaleGrowth.Digital
How Do You Categorize Pages Into Keep, Update, Merge, or Remove?
Category 1: Keep (No Action Required)
A page earns “Keep” status when it meets all of the following conditions:- Organic traffic above your site’s per-page median (calculate this from your GA4 export)
- At least 1 keyword ranking in positions 1-10
- Engagement time above 45 seconds
- Content is factually current (no statistics older than 24 months)
Category 2: Update (Refresh and Optimize)
Update is the largest category in most audits, typically 35% to 45% of all pages. A page gets categorized as Update when:- It ranks for relevant keywords in positions 4-30 (has ranking signals but is not maximizing them)
- Traffic is below the site median but above zero
- Content is partially outdated (statistics from 2022 or earlier, missing sections that competitors now cover)
- Engagement metrics are moderate (30-45 seconds average time)
Category 3: Merge (Consolidate Competing Pages)
Merge applies when two or more pages on your site target the same search intent and neither reaches its ranking potential because Google splits signals between them. The diagnostic criteria:- Two or more URLs share more than 60% of their GSC impression queries
- Google alternates which URL it ranks for the shared queries (check the Pages tab in GSC for each query)
- Neither page ranks in positions 1-5 despite the domain having sufficient authority for the topic
Category 4: Remove (Delete or Redirect)
Remove is the category content teams resist most, but it produces the clearest improvements in site-wide quality signals. A page qualifies for removal when:- Zero organic sessions for 12 consecutive months
- Fewer than 3 referring domains
- Zero presence in any conversion path (direct or assisted)
- The target topic has no viable search demand (cluster volume below 10 monthly searches)
- No other page on the site can absorb a redirect because the topic is irrelevant to the current content strategy
How Do You Run the Audit on 100+ Pages Without Losing Your Mind?
Step 1: Export Your Full URL List (30 Minutes)
Pull every published URL from your CMS or XML sitemap. Include the publish date, last-modified date, word count, and content type (blog post, landing page, resource page, product page). This is your master list. Every subsequent data pull gets merged into this spreadsheet by URL.Step 2: Bulk-Pull Performance Data (2-3 Hours)
Pull data from four sources and merge into the master spreadsheet:- GA4: Export the Landing Page report filtered to organic traffic. Get sessions, engagement time, bounce rate, and conversions for the trailing 6 months. GA4’s data export handles up to 5,000 rows natively.
- Google Search Console: Export the Pages report for 6 months. Get impressions, clicks, CTR, and average position per URL. Also export the Queries report per page for cannibalization detection later.
- Ahrefs or Semrush: Run a batch analysis on your full URL list. Get referring domains, domain rating of linking sites, and total keywords ranked per URL.
- Screaming Frog: Crawl the site to capture word count, title tags, meta descriptions, H1 tags, and internal link count per URL. This gives you the on-page data without manually checking every page.
Step 3: Score and Categorize (3-4 Hours)
Apply the thresholds from the audit table to every URL. Use conditional formatting in your spreadsheet to flag pages that fall below each threshold. Then apply the decision tree to assign each page to Keep, Update, Merge, or Remove. For the AI Citation metric, batch-review pages in groups of 20. Open each page, spend 60 seconds checking for definition-first structure, structured data, and source attribution. Score 1-5 and move to the next. At 60 seconds per page, 300 pages take 5 hours. Split this across two team members over two days.Step 4: Detect Cannibalization (2 Hours)
This step is separate because it requires comparing pages against each other, not evaluating them individually. Export GSC impression queries for every page that ranks for more than 5 keywords. Sort alphabetically by query. Any query that appears against two or more URLs is a cannibalization signal. Flag both URLs and compare their metric profiles to determine which should survive. Automated tools like Ahrefs’ Content Gap or Semrush’s Cannibalization report accelerate this, but manual validation is still required. Automated tools flag false positives when two pages legitimately target different intents for the same keyword.Step 5: Build the Action Plan (2 Hours)
Sort your categorized master list into four tabs: Keep, Update, Merge, Remove. Within each tab, sort by priority score (highest first). The priority score for Updates uses a simple formula: Priority = (Keyword Volume x Position Improvement Potential) + Conversion Value + Backlink Count This formula pushes high-volume striking-distance pages with conversion history and existing backlinks to the top of the Update queue. Those pages will produce the fastest, most measurable results. Total audit time for a 300-page site: 12 to 15 hours spread across 3 to 5 days. That is roughly 3 minutes per page, which is achievable when you batch each step instead of evaluating pages one at a time.How Do You Prioritize Updates When 150 Pages Need Work?
Tier 1: Striking Distance Pages (Positions 6-15, Volume 500+)
These pages are already ranking on the edge of page one or the top of page two. Moving them up 3 to 5 positions produces a disproportionate traffic increase because CTR jumps sharply between position 10 and position 5. A page ranking position 11 for a 2,000 monthly search keyword might receive 20 clicks per month. Moving it to position 5 produces 140 clicks per month. That is a 7x lift from a single content refresh.- Typical effort: 4 to 6 hours per page
- Expected timeline: 4 to 8 weeks for ranking movement
- Action: Expand depth to match top-3 competitors, update all statistics, strengthen the introduction with a direct answer, add 5 to 10 internal links from related pages
Tier 2: High-Conversion Pages With Moderate Traffic
Pages that appear in conversion paths but have room for traffic growth. These are typically bottom-funnel content: comparison pages, pricing guides, use-case pages, and service-specific landing pages. Improving their organic rankings directly increases pipeline because the traffic they attract has purchase intent.- Typical effort: 6 to 10 hours per page
- Expected timeline: 6 to 12 weeks for measurable conversion lift
- Action: Deepen content to capture long-tail variants, add comparison tables, include social proof (case studies, testimonials, specific results), and optimize CTAs for the conversion action
Tier 3: Decaying Pages (Lost 30%+ Traffic in 6 Months)
Pages that once performed well but are losing ground. The decay signals a competitive or freshness problem. Competitors published better content on the same topic, or the page’s data became too outdated for Google to consider it a strong result.- Typical effort: 8 to 12 hours per page (these often need substantial rewrites)
- Expected timeline: 8 to 16 weeks to recover former positions
- Action: Diagnose the cause of decay (check new competitors, content freshness, lost backlinks), then execute a heavy refresh targeting the gaps
How Does the Merge Process Work at Scale?
Step 1: Confirm Cannibalization
Not all pages targeting similar keywords are cannibalizing each other. Two pages can target the same broad topic but serve different intents. “What is a content audit” (informational) and “content audit template download” (transactional) share keywords but serve different users. Before merging, confirm all three signals:- GSC query overlap exceeds 60%
- Google alternates the ranking URL for shared queries
- Neither page reaches position 5 or better for the primary keyword
Step 2: Choose the Survivor
Compare the two pages across four factors: referring domains, total organic traffic, engagement time, and internal link count. The page that wins on 3 or more factors becomes the survivor. In cases of a tie, keep the page with more backlinks because link equity is the hardest signal to rebuild.Step 3: Extract and Integrate
Read the retiring page section by section. Move any content that exists only in the retiring page into the survivor. Do not duplicate content that already exists in the survivor. The goal is to make the survivor strictly better, not longer.Step 4: Redirect and Relink
- Implement a 301 redirect from the retired URL to the survivor
- Update every internal link pointing to the retired URL. Do not rely on the redirect for internal navigation. Direct links pass more equity than redirect chains.
- Update the XML sitemap to remove the retired URL
- If the retired page appeared in any email sequences, resource lists, or external content, update those references
Step 5: Monitor for 8 Weeks
Expect a brief ranking dip in weeks 1 to 2 as Google processes the redirect and reindexes the updated survivor. By week 4, the survivor should show ranking improvement. By week 8, the full consolidation benefit should be visible. If rankings have not improved by week 8, the issue is content quality on the survivor, not the merge itself. Process merges in batches of 3 to 5 per month. Spreading them out allows you to monitor each one individually and catch problems before they compound. On a site with 20 merge candidates, the full consolidation program takes 4 to 6 months.How Do You Handle the Remove Category Without Losing Traffic?
- The page has branded search value. Some pages rank for branded queries that do not show volume in keyword tools but generate consistent direct traffic. Check GSC for branded impressions before removing.
- The page is a linking hub. A page with zero traffic but 15 internal links pointing outward acts as a crawl path for other pages. Removing it without redistributing those internal links can reduce crawl frequency on the pages it linked to.
- The page has seasonal traffic. A gift guide that gets 0 traffic for 10 months but 5,000 sessions in November and December looks like a removal candidate in a July audit. Always check 12 full months.
The Safe Removal Protocol
- Pages with 0 backlinks, 0 traffic, 0 conversions: Return a 410 (Gone) status code. This tells search engines the removal is intentional and accelerates deindexing.
- Pages with 1+ backlinks but 0 traffic: 301 redirect to the most topically relevant active page. The backlink equity transfers to the redirect target.
- Pages where no relevant redirect target exists: 301 redirect to the parent category page or the blog index. Suboptimal, but preserves equity better than a hard delete.
- Total indexed pages in GSC. Should decrease by approximately the number of pages you removed.
- Total organic sessions across remaining pages. Should remain flat or increase. A decrease signals you removed something with hidden value.
- Crawl stats in GSC. Pages crawled per day should remain stable. Time spent downloading should decrease slightly as crawl budget is freed.
What Does the Quarterly Audit Cadence Look Like?
Quarter 1: The Baseline Audit
This is the full audit described in this framework. Every page scored, categorized, and prioritized. The output is your master content inventory with action assignments. Expect the first cycle to reveal 60 to 80 pages that need immediate attention across Update, Merge, and Remove.Quarter 2: Action + Incremental Review
Execute the highest-priority actions from Q1. Simultaneously, re-evaluate pages that were borderline between categories. A page that was barely in Keep during Q1 may have declined into Update. A page you updated in Q1 may have climbed into Keep. Add any new pages published during the quarter to the master inventory and score them.Quarter 3: Pattern Recognition
By the third cycle, you have 9 months of data. Patterns emerge. You can identify which content types consistently perform, which topics your site struggles to rank for regardless of content quality (indicating a domain authority gap), and which updates produced the largest ranking lifts. Use these patterns to refine your content strategy for the next 12 months.Quarter 4: Annual Review + Strategy Reset
The fourth quarterly audit doubles as an annual review. Compare your content inventory from Q1 to now:- How many pages moved from Update to Keep? (Measures refresh effectiveness)
- How many merges resulted in position 1-5 rankings? (Measures consolidation ROI)
- What happened to site-wide organic traffic after removals? (Validates the pruning strategy)
- Which new pages published this year are already in Keep? (Measures brief and targeting accuracy)
“We have run quarterly content audits for clients with 150-page sites and clients with 2,000-page sites. The framework scales. What changes is the time per cycle and the number of merge candidates. The decision logic stays identical because the metrics and thresholds are universal.”
Hardik Shah, Founder of ScaleGrowth.Digital
How Do You Turn Audit Data Into a Measurable Content Roadmap?
Roadmap Structure
Build the roadmap as a 13-week sprint plan aligned to your quarterly cadence. Allocate capacity across categories using this distribution:- Week 1: Finalize audit, build the roadmap, assign owners
- Weeks 2-3: Execute all Remove actions and Tier 1 Updates (striking distance pages). Early wins build momentum and stakeholder confidence.
- Weeks 4-8: Execute Merge actions (3 to 5 per month) and Tier 2 Updates (conversion pages)
- Weeks 9-12: Execute Tier 3 Updates (decaying pages) and any new content briefs that emerged from the gap analysis
- Week 13: Measure results, document learnings, prepare the next quarter’s audit
Success Metrics Per Category
Every action type has a defined success metric measured 90 days after execution:- Updates: Did the page’s organic traffic increase by at least 25% compared to the pre-update 90-day period?
- Merges: Does the survivor page now rank higher than either original page did for the shared keywords?
- Removes: Did site-wide organic traffic remain flat or increase in the 30 days following removal?
- Keeps: Did these pages maintain their performance without intervention? (If not, they should move to Update next quarter.)
Audit Your Content Library
We will score every page on your site across six performance dimensions, categorize each one into keep, update, merge, or remove, and deliver a prioritized quarterly roadmap your team can execute from day one. Talk to Our Team →