Crawl, chunk, and vectorize any website so you can use their content with LLMs.
curl --request POST \
--url https://api.embedding.io/v0/query \
--header 'Authorization: Bearer V09XIFlPVSdSRSBBIEhBQ0tFUg==' \
--json '{
"collection": "col_lPMjKLBRLZ4qVe",
"query": "Why should I avoid sunflower oil?"
}'
Use our API or web interface to make a collection. This holds the pages or websites you want to use.
curl --request POST \
--url https://api.embedding.io/v0/collections \
--header 'Authorization: Bearer V09XIFlPVSdSRSBBIEhBQ0tFUg==' \
--json '{
"name": "Health Gurus"
}'
Add web pages to your collection. We will handle the rest, including updates.
curl --request POST \
--url https://api.embedding.io/v0/collections/col_lPMjKLBRLZ4qVe/websites \
--header 'Authorization: Bearer V09XIFlPVSdSRSBBIEhBQ0tFUg==' \
--json '{
"domains": [
"https://peterattiamd.com/",
"https://www.foundmyfitness.com/"
]
}'
Once your collection is ready, query it using our API. We keep the data updated for you.
curl --request POST \
--url https://api.embedding.io/v0/query \
--header 'Authorization: Bearer V09XIFlPVSdSRSBBIEhBQ0tFUg==' \
--json '{
"collection": "col_lPMjKLBRLZ4qVe",
"query": "Why should I avoid sunflower oil?"
}'
Try our public collections
Get started for free.
500 credits
$0 /month
Sign up for Free2,000 credits
$20 /month
Start for Free20,000 credits
$100 /month
Start for FreeEverything in Startup, plus:
Frequently Asked Questions
A credit is a unit of measurement used to track your usage of our services. Each page you add to your collection consumes one credit. Additionally, each time a page is updated, it also consumes one credit.
You can embed any publicly available content on the internet, including websites, blogs, documentation, and more.
No, your website does not need to have a sitemap. Our system can crawl and extract content from any publicly accessible web pages.
Our crawler will extract the main content of the page, leaving aside the navigation, ads, and other irrelevant content. We use a combination of machine learning and heuristics to determine what is the main content of the page.
The content is updated based on the plan you are subscribed to. The Free plan updates the content monthly, the Hobby plan updates the content weekly, and the Startup plan updates the content daily. You can also trigger an update manually.
Yes, you can use our API to create, manage, and query your collections. Detailed API documentation is available to help you get started.
If you exceed your credit limit, you will need to upgrade to a higher plan to continue adding pages to your collections. You can also contact us for custom solutions.
Yes, with the Enterprise plan, you can have custom crawlers tailored to your specific needs. Contact us for more details.
Yes, you can cancel your subscription at any time. Your account will remain active until the end of the current billing cycle.
Plug your content into an LLM in less than 5 minutes.