Exhaustive Search (count:all)
Learn and understand more about Sourcegraph's Exhaustive Search and core functionality.
Exhaustive Search is a search that returns the complete set of every result matching an expression. Sourcegraph's search is optimized for fast, interactive searching, and as such, there are time and match limits that can stop a search before it is exhaustive. To remove the limits, add count:all
to your search query.
Exhaustive search is often needed when you want to solve security, compliance, code health, and other automated use cases based on the output of a search.
Slow queries
A query can be slow due to not using the index (commit/diff, non-indexed branches), poorly using the index (complicated regular expression), or having a large result set. This is a common concern with the exhaustive use cases, and we expect Sourcegraph to still return accurate results.
Once the query is complete, you will always get accurate statistics (e.g., number of matches) for a count: all
query. A repository will be reported directly if it is not searched (e.g., not cloned).
Over time, the priority of a query is reduced. This ensures that we can answer interactive queries quickly while allowing slow queries to run to completion.
Timeouts
There are two sources of timeouts in a count:all
query:
- A timeout in the HTTP load balancer in front of Sourcegraph (nginx/ELB/Cloudflare/etc). Your admin will likely need to increase timeouts for Sourcegraph endpoints. In particular, the
.api/search/stream
path. This uses SSE, so your reverse proxy may have specific support for these requests. - A maximum timeout enforced by Sourcegraph. Your admin may need to increase the site configuration value (default 60s) with the following setting:
"search.limits": {
"maxTimeoutSeconds": 60,
},
Large result sets
The Sourcegraph web app will only display up to 500 results (however will continue to display accurate statistics). If you need to process more than 500 results, please use the Sourcegraph CLI. For now, you will need to pass in the -stream
flag to get large result sets efficiently.
Limitations
Missing on Sourcegraph.com
This is a specific limitation for Sourcegraph.com and does not apply to customer instances. Sourcegraph.com does not keep a copy of all open source code it has ever discovered. However, it remembers that it discovered it. This leads to the repository showing up as "missing" when your repository filter includes it.
Non-indexed backends
A search is unindexed if you search non-indexed branches or use diff/commit search. Using a non-indexed backend and searching all code in a large instance can take 10min+. This is likely much higher than any configured timeouts. See the Timeouts section on configuring this use case.
Non-indexed backends do not use the same scheduling logic as indexed backends. This means concurrent slow, non-indexed searches will impact the resources of interactive searches.
An unindexed search can under-report result counts. This is due to limits on the number of results reported per file. See #18298.