AWS AI Services & Amazon Bedrock

Pre-built AI APIs — no ML expertise needed. Use when you don't want to build/train your own model.

AI Services Overview

graph TD
    AI[AWS AI Services] --> Vision[Computer Vision]
    AI --> NLP[Natural Language]
    AI --> Speech[Speech]
    AI --> Forecast[Forecasting]
    AI --> Rec[Recommendations]
    AI --> Search[Search]
    AI --> Codegen[Code Generation]
    AI --> GenAI[Generative AI]

    Vision --> Rek[Rekognition<br/>Images + Video]
    NLP --> Comp[Comprehend<br/>Text Analysis]
    NLP --> Tx[Textract<br/>Document OCR]
    NLP --> Trans[Translate]
    NLP --> Kendra[Kendra<br/>Intelligent Search]
    Speech --> Transcribe[Transcribe<br/>Speech → Text]
    Speech --> Polly[Polly<br/>Text → Speech]
    Forecast --> Fore[Amazon Forecast]
    Rec --> Pers[Personalize]
    Search --> Kendra
    GenAI --> Bed[Bedrock<br/>Foundation Models]
    Codegen --> CW[CodeWhisperer]

    style AI fill:#dbeafe,stroke:#3b82f6
    style GenAI fill:#f3e8ff,stroke:#9333ea

Amazon Rekognition

Computer vision API — images and video.

Feature	What It Does
Object & Scene Detection	Detect objects (dog, car, tree), scenes (beach, office)
Facial Analysis	Detect faces, attributes (age range, emotion, gender, glasses)
Face Comparison	Compare two faces — similarity score
Face Search (IndexFaces)	Match face against a collection (identity verification)
Celebrity Recognition	Identify famous people
Text in Image (OCR)	Detect text in images (signs, license plates)
Content Moderation	Detect explicit, violent, or unsafe content
PPE Detection	Detect safety equipment (hard hats, masks)
Label Detection (Video)	Detect objects/activities in video, timestamp-based
Segment Detection (Video)	Find technical cues (black frames, credits, shots)

Key Use Cases:

User verification (match selfie to ID)
Content moderation (filter explicit images)
Smart media archive (tag and search videos)
Workplace safety monitoring (PPE compliance)

Exam tip: Rekognition Custom Labels — train Rekognition on your own images (few-shot) without deep ML knowledge.

Amazon Comprehend

NLP API — extract meaning from text.

Feature	What It Does
Sentiment Analysis	Positive / Negative / Neutral / Mixed
Entity Recognition	Persons, organisations, locations, dates, quantities
Key Phrase Extraction	Important phrases in text
Language Detection	Detect which language the text is in
Syntax Analysis	POS tagging (noun, verb, adjective)
Topic Modelling	Discover topics in document collection (LDA-based)
PII Detection	Find personally identifiable information (redact or classify)
Custom Classification	Train custom text classifier on your categories
Custom Entity Recognition	Train to recognise custom entities (product codes, internal terms)

Key Use Cases:

Customer feedback analysis (sentiment)
Legal document processing (entity extraction)
Medical records (Comprehend Medical)
GDPR compliance (PII detection + redaction)

Amazon Comprehend Medical

Specialised NLP for healthcare and medical text
Detects: diagnoses, medications, dosage, test results, anatomy
Protected Health Information (PHI) detection for HIPAA compliance

Amazon Textract

Extract text and structured data from documents (beyond basic OCR).

Feature	What It Does
Text Extraction	Detect and extract all printed/handwritten text
Forms Extraction	Extract key-value pairs from forms (Name: John)
Tables Extraction	Extract tabular data preserving structure
Query-based Extraction	Ask natural language questions ("What is the invoice total?")
Signature Detection	Detect presence of signatures
Expense Analysis	Specialised for receipts and invoices
Identity Documents	Driver's licences, passports (structured extraction)

Rekognition vs Textract:

Rekognition = text in images (signs, photos)

Textract = text in documents (PDFs, forms, tables)

Amazon Transcribe

Automatic Speech Recognition (ASR) — audio/video to text.

Feature	Notes
Streaming Transcription	Real-time, low-latency
Batch Transcription	Async, upload audio file to S3
Speaker Diarisation	Distinguish multiple speakers ("Speaker 1:", "Speaker 2:")
Custom Vocabulary	Add domain-specific words (medical, legal, brand names)
Custom Language Model	Fine-tune on domain-specific text corpus
PII Redaction	Auto-remove SSN, credit card numbers from transcript
Toxicity Detection	Flag harmful speech
Medical Transcription	Transcribe Medical — optimised for clinical vocabulary
Subtitles	Generate SRT/VTT files for video

Amazon Polly

Text-to-Speech (TTS) — text to lifelike audio.

Feature	Notes
Standard Voices	Concatenative TTS
Neural TTS (NTTS)	Higher quality, more natural
SSML Support	Markup to control emphasis, speed, pauses, pronunciation
Lexicons	Custom pronunciation rules (e.g. "AWS" → "Amazon Web Services")
Speech Marks	Metadata: word/sentence timings (for lip sync, karaoke)
Long-form TTS	Optimised for long documents

Amazon Translate

Neural machine translation — real-time or batch.

75+ languages
Formality control — formal vs informal tone
Custom terminology — preserve brand names, technical terms
Active Custom Translation — fine-tune with parallel data (source → target examples)
Real-time: API call with text
Batch: S3 input → S3 output

Amazon Lex

Build conversational chatbots — powers Alexa.

Component	Description
Intent	What the user wants to do (BookHotel, GetWeather)
Utterance	Example phrases that trigger the intent
Slot	Parameter to collect (city, date, room type)
Fulfillment	Lambda function called when intent is satisfied

Supports voice + text
Integrates with: Connect (call centres), Kendra (Q&A), Lambda (fulfilment)
V2 supports multi-turn conversations and streaming responses

Amazon Kendra

Intelligent enterprise search — NLP-powered.

Indexes documents from: S3, SharePoint, Salesforce, ServiceNow, RDS, websites
Returns specific answers (not just documents) — understands questions
Custom synonyms — map internal terminology
Access Control — respects document-level permissions
Incremental learning — improves from user feedback

Kendra vs OpenSearch:

Kendra = semantic NLP-based question answering, enterprise docs

OpenSearch = keyword/full-text search, log analysis, custom ranking

Amazon Forecast

Time series forecasting service — no ML expertise needed.

Feature	Notes
AutoML	Automatically selects best algorithm
Built-in algorithms	DeepAR+, NPTS, ETS, ARIMA, Prophet
Related time series	Include external factors (price, promotions, weather)
Item metadata	Add static attributes (category, brand)
Probabilistic forecasts	Returns P10, P50, P90 quantiles (confidence intervals)
What-if Analysis	Simulate scenarios (what if price drops 10%?)
Explainability	Impact of each feature on forecast

DeepAR (SageMaker) vs Amazon Forecast:

Forecast = fully managed, no code, AutoML

DeepAR = more control, custom training, SageMaker integration

Amazon Personalize

Real-time personalisation and recommendation service.

Recipe	Use Case
User-Personalisation	Personalised item recommendations per user
Personalised-Ranking	Re-rank a pre-filtered list for a user
Similar-Items	"Customers also viewed"
Trending-Now	Items gaining popularity
Next-Best-Action	Predict next best action/interaction

Concepts:

Dataset Group — container for datasets
Interactions dataset — required (user-item events: clicks, purchases, ratings)
Items dataset — optional (item metadata: category, price)
Users dataset — optional (user metadata: age, gender)
Solution — trained model (choose recipe)
Campaign — deployed solution for real-time recommendations

Real-time vs Batch:

Real-time: Campaign endpoint — call API per user
Batch: Batch Segment job — generate recs for all users at once → S3

Amazon Bedrock

Fully managed access to Foundation Models (FMs) via API.

Overview

No infrastructure to manage — call FM via API
Models from: Anthropic (Claude), Meta (Llama), Mistral, Amazon (Titan), Cohere, AI21
Private — your data and prompts are NOT used to train base models
Fine-tuned models stay in your account

Core Features

Feature	Description
Model Inference	Call any supported FM via unified API
Knowledge Bases	Managed RAG — connect FM to your documents
Agents	Autonomous agents that call APIs and take actions
Fine-tuning	Customise FM with your labelled data
Continued Pre-training	Adapt FM to your domain with unlabelled data
Model Evaluation	Compare models on custom prompts
Guardrails	Filter harmful content, PII, off-topic responses
Watermark Detection	Detect AI-generated text (Amazon Titan)

Knowledge Bases (RAG)

graph LR
    Docs[Your Documents<br/>S3] --> KB[Knowledge Base<br/>Bedrock]
    KB --> VS[Vector Store<br/>OpenSearch / Aurora / Pinecone]
    User[User Query] --> KB
    KB --> FM[Foundation Model]
    FM --> Answer[Grounded Answer]

Manages chunking, embedding, and vector store indexing automatically
Supported vector stores: OpenSearch Serverless, Aurora PostgreSQL, Pinecone, Redis Enterprise

Bedrock Agents

Autonomously break tasks into steps, call APIs, retrieve knowledge
Action Groups — define what APIs the agent can call (Lambda + OpenAPI spec)
Memory — agents can remember conversation context across sessions
Use for: customer service automation, data analysis, multi-step workflows

Guardrails

Content filtering — block hate speech, violence, sexual content
Denied topics — block specific subjects (e.g. competitor mentions)
Word filters — block specific words/phrases
PII redaction — detect and redact personal data
Grounding check — verify response is grounded in context (reduces hallucination)

Amazon Titan Models

Model	Type
Titan Text Premier/Express/Lite	Text generation
Titan Text Embeddings	Text embeddings for RAG
Titan Multimodal Embeddings	Image + text embeddings
Titan Image Generator	Text-to-image

AI Services — Exam Comparison Table

Scenario	Service
Detect objects in uploaded photos	Rekognition
Moderate user-generated content	Rekognition Content Moderation
Verify user identity with face match	Rekognition Face Comparison
Extract sentiment from customer reviews	Comprehend
Find medical terms in clinical notes	Comprehend Medical
Extract tables from PDF invoices	Textract
Transcribe call centre recordings	Transcribe
Build IVR chatbot for call centre	Lex + Connect
Enterprise document search ("What is our leave policy?")	Kendra
Demand forecasting with promotions data	Amazon Forecast
Personalised product recommendations	Personalize
Text-to-speech for e-learning	Polly
Translate product listings to 10 languages	Translate
Build RAG chatbot on company documents	Bedrock Knowledge Bases
Build autonomous AI agent for customer service	Bedrock Agents
Fine-tune Claude on proprietary data	Bedrock Fine-tuning
Prevent AI from responding off-topic	Bedrock Guardrails
Access GPT-like model without OpenAI	Bedrock (Claude, Titan, Llama)

AWS AI Services & Bedrock — AWS ML Associate