What is Image to Text?
A generic OCR task for solving text-based captchas. Send a base64-encoded image containing distorted text, and we return the recognized text using our ML OCR pipeline.
How It Works
1
Send Task
POST your ImageToTextTask with the target URL and sitekey to our API. We'll queue it instantly.
2
We Solve
Capzy's proprietary OCR transcribes the text from the image. Designed for accessibility tooling, archival document processing, and authorized automation workflows.
3
Get Token
Poll getTaskResult — when status is 'ready', the solution contains the token to inject into the target page.
Quick Integration
solve.py
import requests, time
API = "https://api.capzy.ai"
KEY = "capzy_your_key_here"
# Step 1: Create task
task = requests.post(f"{API}/createTask", json={
"clientKey": KEY,
"task": {
"type": "ImageToTextTask",
"body": "/9j/4AAQSkZJRgABAQ..."
}
}).json()
task_id = task["taskId"]
print(f"Task created: {task_id}")
# Step 2: Poll for result
while True:
result = requests.post(f"{API}/getTaskResult", json={
"clientKey": KEY,
"taskId": task_id
}).json()
if result["status"] == "ready":
print("Solved!", result["solution"])
break
elif result["status"] == "failed":
print("Failed:", result.get("errorDescription"))
break
time.sleep(1)task parameters.
fieldtyperequireddescription
typetypestringreqyesImageToTextTaskbodytypestringreqyesBase64-encoded image (PNG, JPG, GIF, BMP)moduletypestringreqnoOCR module hint: 'common' (default), 'number' (digits only), 'alpha' (letters only)solution response.
fieldtypedescription
texttypestringThe recognized text from the imageFeatures
Works on any text-based captcha
Capzy proprietary OCR models
Supports distorted/noisy text
< 1 second solve time
ProxyLess Task Type
ImageToTextTaskFrequently Asked Questions
start solving image to text.
$0.10 in free credits — no card. ~250 free solves to test before you spend.