SarudoResearch Path
FeaturesHow It WorksPricing↗ SwitchReseller↗ SwitchDocsAbout
Get Started
Sarudo logo — AI Employee platformSarudo

AI Employees for Modern Businesses

Product

  • Features
  • How It Works
  • Documentation
  • Pricing
  • WordPress plugin
  • Reseller Program
  • FAQ

Company

  • About
  • Careers
  • Blog
  • Contact

Legal

  • Terms of Service
  • Privacy Policy
  • Refund Policy
  • SLA
  • Acceptable Use
  • Data Processing

© 2026 Sarudo. All rights reserved.

hello@sarudo.com
What is Sarudo?Onboarding ProcessSetting Up TelegramYour First InteractionWhat Your AI Employee Can DoSecurity & PrivacyYour First Conversation with SarudoWhat's Under the HoodBackups & Data Export
Telegram Commands ReferenceManaging ConversationsFile SharingApproval WorkflowTips for Effective CommunicationMulti-User Access
Email Setup & ConfigurationSending & Drafting EmailsReading & Searching InboxEmail Approval FlowEmail Use Cases
Voice Call SetupMaking Outbound CallsCall TranscriptionAI-Powered ConversationsCall History & RecordingsVoice Providers & Options
What Meetings Can DoUploading a RecordingAutomatic TranscriptionAction Items & AttendeesFollowing Up on Action Items
Managing Your CalendarReminders & NotificationsScheduling for OthersDaily Briefings
How Sarudo LearnsStoring & Retrieving KnowledgeDocument IngestionSemantic SearchKnowledge CategoriesContradiction HandlingSettings vs Knowledge
Web SearchWebsite BrowsingCompetitor ResearchYouTube & Video AnalysisLocal Business SearchImage Search
SEO Tools OverviewKeyword ResearchTrending Topics & Blog Gap AnalysisSERP Analysis & Competitor TrackingPutting It Together — A Content Research Workflow
Creating DocumentsPDF OperationsFormat ConversionOCR & Text ExtractionPresentationsDiagrams & Visuals
Built-in TemplatesCustom TemplatesRendering DocumentsBulk Mail Merge
CRM OverviewManaging ContactsCompanies & OrganizationsDeals & PipelineActivity TrackingFollow-ups & RemindersHow Deletion Works
Email EnrichmentDomain & Company LookupEmail FinderLinkedIn Enrichment
Automation OverviewCreating WorkflowsPre-Built TemplatesManaging WorkflowsBuilt-in AutomationsWorkflow Reliability FeaturesDry-Run Mode
How the Pipeline WorksStage 1 — Monthly ResearchStage 2 — Daily DrafterStage 3 — Publish LoopSupported CMS TargetsTuning the Pipeline
Social Media SetupDrafting PostsScheduling & PublishingSocial Post CalendarApproval WorkflowPublishing to Your Own Blog
Stripe Integration SetupCreating Checkout LinksSending InvoicesPayment TrackingProcessing Refunds
Notion IntegrationGoogle Sheets IntegrationAirtable IntegrationWebhook EventsBrowser AutomationMedia ProcessingGoogle Docs IntegrationBrowser Automation — Local vs Cloud
  1. Docs
  2. >
  3. Documents & Content
  4. >
  5. OCR & Text Extraction

OCR & Text Extraction

Extracting text from images, scanned documents, and photos using optical character recognition.

Last updated: April 14, 2026
ocrtext extractionimagesscanningphotosrecognition

What is OCR?

OCR (Optical Character Recognition) is the technology that extracts readable text from images. Your AI employee uses OCR to read text from photos, screenshots, scanned documents, business cards, receipts, whiteboards, and any other image containing text. The extracted text can then be edited, searched, stored in your knowledge base, or used as input for other tasks.

Using OCR

To extract text from an image, simply send the image to your AI employee and ask it to read the text. You can send photos taken with your phone, screenshots from your computer, or scanned document images. The AI processes the image and returns the extracted text in a clean, editable format. It handles multiple languages, various fonts, and both printed and handwritten text (though handwriting accuracy varies).

Extracting text from a photo

Read text from an image.

You say:
[Sends photo of a business card] Save this contact information.
Sarudo responds:
I've extracted the text from the business card: Jennifer Park, VP of Marketing, DataStream Inc. Email: jennifer.park@datastream.io. Phone: (650) 555-0173. Address: 500 Tech Drive, Suite 400, Palo Alto, CA 94301. I've created a new contact record in your CRM with these details. Would you like to add any notes about where you met Jennifer?

Scanned Documents

For scanned PDFs (PDFs that are essentially images of pages rather than digital text), the AI applies OCR to convert them into searchable, editable text. This is particularly useful for digitizing old contracts, letters, or any paper documents. After OCR processing, the text can be ingested into your knowledge base, making the information from physical documents as searchable as digital content.

ℹ️

For the best OCR results, scan documents at 300 DPI or higher. Ensure good lighting and a flat surface when photographing documents with your phone.

Accuracy & Tips

OCR accuracy depends on image quality, font clarity, and document layout. Clean, well-lit images of printed text typically achieve 98% or higher accuracy. Factors that can reduce accuracy include low resolution, poor lighting, unusual fonts, handwriting, and complex layouts with overlapping text and images. For critical documents, always review the extracted text for accuracy. The AI will flag sections where it had low confidence in the extraction.

Related Articles

Document Ingestion
Uploading PDFs, DOCX files, spreadsheets, and presentations for automatic chunking, embedding, and knowledge extraction.
PDF Operations
Merge, split, compress, encrypt, and decrypt PDF files using your AI employee.
File Sharing
How to send files to your AI employee and receive generated files back, including supported formats and download links.
Previous
Format Conversion
Documents & Content
Next
Presentations
Documents & Content

On This Page