Create Jobs (Investigations)
Basic overview
Section titled “Basic overview”You can create a job from the button in the upper-right corner of Index Worker.

Input fields
Section titled “Input fields”-
Job name: Enter any clear, easy-to-understand name.
-
Memo (optional): Enter anything you want to note.
-
Search Console property: Select the Search Console property to investigate. (Google API limits allow up to 2,000 URLs per property per day.)
-
URL type: Choose an XML sitemap, RSS feed, or manual input / CSV. For an XML sitemap or RSS feed, a field for entering the URL appears immediately afterward. Enter the URL there. For CSV, no header row is required. Prepare a file that lists only URLs separated by commas.
-
Sampling period (RSS feed investigations only): This can be specified only for RSS feed investigations. Use it when you want to understand the crawl status of recent articles.
-
Sampling size: You can specify up to 2,000 URLs. We recommend setting this to around 500.
-
Schedule: Choose either “Schedule (recurring run)” or “Run once (immediate run).”
-
Notes: Use this freely as a memo field.
-
URL parameters to ignore: Specify these only when the investigation URLs contain parameters you want to ignore.
🧩 Important feature Try specifying an XML sitemap and then pressing the “Specify sampling conditions” button.
For example, after specifying an XML sitemap that lists all URLs, you can add a sampling condition such as “investigate only a specific directory within that sitemap.”
📣 For an XML sitemap, you can also register a sitemap index file.
🎨 When creating a job, you can filter URLs by URL group. This is useful because you can specify the scope you want to investigate even when using an XML sitemap.
🖇️ You can investigate up to 2,000 URLs per day for each Search Console property. This is a Google API limit, not an Amethyst feature limit.
The limit is per property, not 2,000 per site.
🔬 When more than 2,000 URLs are configured in Index Worker, Amethyst performs appropriate sampling instead of a full investigation. It calculates index and crawl rates at a level reliable enough for practical use. Investigation result charts show a 95% confidence interval. (See the reference section on the 95% Wilson Confidence Interval.)
💡 Full investigations are not recommended. In many cases, a sample size of 500 is enough. One advantage of Amethyst is that it can calculate the index rate through sampling without investigating every URL.
Reference: 95% Wilson Confidence Interval
| p̂ / n | 2000 | 1000 | 500 | 300 | 100 |
|---|---|---|---|---|---|
| 99% | 0.9891 ± 0.004457 | 0.9881 ± 0.006434 | 0.9863 ± 0.009457 | 0.9838 ± 0.01279 | 0.9719 ± 0.02636 |
| 95% | 0.9491 ± 0.009581 | 0.9483 ± 0.01359 | 0.9466 ± 0.01934 | 0.9443 ± 0.02516 | 0.9334 ± 0.0451 |
| 70% | 0.6996 ± 0.02007 | 0.6992 ± 0.02836 | 0.6985 ± 0.04004 | 0.6975 ± 0.05159 | 0.6926 ± 0.08845 |
| 50% | 0.5 ± 0.02189 | 0.5 ± 0.03093 | 0.5 ± 0.04366 | 0.5 ± 0.05622 | 0.5 ± 0.09617 |