The Scenario
You inherited a spreadsheet from the lead-gen specialist who left last month. Column A has 150 "About Us" page URLs she had collected from prospect companies. The original plan was to enrich each one with company name, headcount, and founding year — data your sales team needs before outreach. The enrichment never happened. Now you're the one with access to Kadoa and a shared schema ID that's supposed to handle exactly this extraction, and a sales director who asked about the list in yesterday's standup.
The bad version:
- Open Kadoa, create an ad-hoc extraction job for the URL in row 2, wait for it to finish, copy the three fields from the results screen, paste them into columns B, C, and D.
- Repeat this 149 more times, except Kadoa times out on row 47 and you lose track of where you were, and by row 90 you're copy-pasting the wrong fields into the wrong columns because your eyes have crossed.
- Send the half-finished sheet to the sales director anyway with a note that you'll "finish it tomorrow."
The cognitive cost of doing this manually is not the time — it's the sustained, grinding attention it takes to not make errors across 150 identical micro-tasks. That's a budget nobody has.
The Easy Way: One Prompt in SheetXAI
SheetXAI is an AI agent that lives inside your Google Sheet. It reads column A, sends each URL to Kadoa using the schema you specify, and writes the extracted fields back into the same rows.
The prompt for this task:
For each URL in column A of this sheet (rows 2–151), run a Kadoa ad-hoc extraction using schema ID sch_xyz789 and write the extracted 'companyName', 'headcount', and 'foundingYear' fields back to columns B, C, and D.
What You Get
- Columns B, C, and D populated with the extracted values for each URL, one row per company.
- Rows where Kadoa could not extract a field left blank rather than filled with a placeholder, so missing data is visible rather than masked.
- A Status column added automatically flagging rows where the extraction failed (timeout, blocked page, no matching content) with a brief reason.
- The sheet ready to hand to the sales team without a manual review pass for formatting issues.
What If the Data Is Not Quite Ready
Some URLs redirect or return errors
Kadoa can hit redirects, bot-detection blocks, or 404s on certain pages. You want those flagged without stopping the rest:
For each URL in column A (rows 2–151), run a Kadoa ad-hoc extraction using schema ID sch_xyz789. Write the extracted 'companyName', 'headcount', and 'foundingYear' to columns B, C, and D. For any URL that returns an error or produces no results, write the error type to column E and leave B through D blank.
You also want to crawl one site and pull a list of subpages
One of the prospects has a complex site and you want to see what pages Kadoa can find before running the full extraction:
Crawl the site URL in cell B2, retrieve the list of crawled pages, and paste their URLs and page titles into a new sheet tab named 'Crawl Results'.
The schema ID varies by company type
Your sheet has a "Type" column (column E) — "tech", "retail", or "services" — and each type uses a different Kadoa schema:
For each URL in column A (rows 2–151), check the company type in column E. If it's 'tech', use schema ID sch_tech001; if 'retail', use sch_retail002; if 'services', use sch_serv003. Run the Kadoa extraction and write the results to columns B, C, and D.
Full pipeline: extract, validate completeness, and flag priority gaps for the sales team
For each URL in column A (rows 2–151), run a Kadoa ad-hoc extraction using schema ID sch_xyz789 and write 'companyName', 'headcount', and 'foundingYear' to columns B, C, and D. Then add a 'Completeness' column in E showing what percentage of the three fields were successfully extracted, and add a 'Priority Gap' flag in column F for any company where headcount is missing but founding year is over 10 years ago — these are the accounts the sales team most needs enriched manually.
One prompt does the extraction and surfaces the gaps that need a human touch — no separate quality check.
Try It
Get the 7-day free trial of SheetXAI and open any Google Sheet with a list of company URLs or prospect domains, then ask it to run Kadoa extractions across the list and write the results back. For related reading, see how to manage Kadoa schemas from a sheet registry or the Kadoa overview.
