AWS AI Services & Bedrock — AWS ML Associate
AWS AI Services & Amazon Bedrock
Pre-built AI APIs — no ML expertise needed. Use when you don't want to build/train your own model.
AI Services Overview
graph TD
AI[AWS AI Services] --> Vision[Computer Vision]
AI --> NLP[Natural Language]
AI --> Speech[Speech]
AI --> Forecast[Forecasting]
AI --> Rec[Recommendations]
AI --> Search[Search]
AI --> Codegen[Code Generation]
AI --> GenAI[Generative AI]
Vision --> Rek[Rekognition<br/>Images + Video]
NLP --> Comp[Comprehend<br/>Text Analysis]
NLP --> Tx[Textract<br/>Document OCR]
NLP --> Trans[Translate]
NLP --> Kendra[Kendra<br/>Intelligent Search]
Speech --> Transcribe[Transcribe<br/>Speech → Text]
Speech --> Polly[Polly<br/>Text → Speech]
Forecast --> Fore[Amazon Forecast]
Rec --> Pers[Personalize]
Search --> Kendra
GenAI --> Bed[Bedrock<br/>Foundation Models]
Codegen --> CW[CodeWhisperer]
style AI fill:#dbeafe,stroke:#3b82f6
style GenAI fill:#f3e8ff,stroke:#9333ea
Amazon Rekognition
Computer vision API — images and video.
| Feature | What It Does |
|---|---|
| Object & Scene Detection | Detect objects (dog, car, tree), scenes (beach, office) |
| Facial Analysis | Detect faces, attributes (age range, emotion, gender, glasses) |
| Face Comparison | Compare two faces — similarity score |
| Face Search (IndexFaces) | Match face against a collection (identity verification) |
| Celebrity Recognition | Identify famous people |
| Text in Image (OCR) | Detect text in images (signs, license plates) |
| Content Moderation | Detect explicit, violent, or unsafe content |
| PPE Detection | Detect safety equipment (hard hats, masks) |
| Label Detection (Video) | Detect objects/activities in video, timestamp-based |
| Segment Detection (Video) | Find technical cues (black frames, credits, shots) |
Key Use Cases:
- User verification (match selfie to ID)
- Content moderation (filter explicit images)
- Smart media archive (tag and search videos)
- Workplace safety monitoring (PPE compliance)
Exam tip: Rekognition Custom Labels — train Rekognition on your own images (few-shot) without deep ML knowledge.
Amazon Comprehend
NLP API — extract meaning from text.
| Feature | What It Does |
|---|---|
| Sentiment Analysis | Positive / Negative / Neutral / Mixed |
| Entity Recognition | Persons, organisations, locations, dates, quantities |
| Key Phrase Extraction | Important phrases in text |
| Language Detection | Detect which language the text is in |
| Syntax Analysis | POS tagging (noun, verb, adjective) |
| Topic Modelling | Discover topics in document collection (LDA-based) |
| PII Detection | Find personally identifiable information (redact or classify) |
| Custom Classification | Train custom text classifier on your categories |
| Custom Entity Recognition | Train to recognise custom entities (product codes, internal terms) |
Key Use Cases:
- Customer feedback analysis (sentiment)
- Legal document processing (entity extraction)
- Medical records (Comprehend Medical)
- GDPR compliance (PII detection + redaction)
Amazon Comprehend Medical
- Specialised NLP for healthcare and medical text
- Detects: diagnoses, medications, dosage, test results, anatomy
- Protected Health Information (PHI) detection for HIPAA compliance
Amazon Textract
Extract text and structured data from documents (beyond basic OCR).
| Feature | What It Does |
|---|---|
| Text Extraction | Detect and extract all printed/handwritten text |
| Forms Extraction | Extract key-value pairs from forms (Name: John) |
| Tables Extraction | Extract tabular data preserving structure |
| Query-based Extraction | Ask natural language questions ("What is the invoice total?") |
| Signature Detection | Detect presence of signatures |
| Expense Analysis | Specialised for receipts and invoices |
| Identity Documents | Driver's licences, passports (structured extraction) |
Rekognition vs Textract:
- Rekognition = text in images (signs, photos)
- Textract = text in documents (PDFs, forms, tables)
Amazon Transcribe
Automatic Speech Recognition (ASR) — audio/video to text.
| Feature | Notes |
|---|---|
| Streaming Transcription | Real-time, low-latency |
| Batch Transcription | Async, upload audio file to S3 |
| Speaker Diarisation | Distinguish multiple speakers ("Speaker 1:", "Speaker 2:") |
| Custom Vocabulary | Add domain-specific words (medical, legal, brand names) |
| Custom Language Model | Fine-tune on domain-specific text corpus |
| PII Redaction | Auto-remove SSN, credit card numbers from transcript |
| Toxicity Detection | Flag harmful speech |
| Medical Transcription | Transcribe Medical — optimised for clinical vocabulary |
| Subtitles | Generate SRT/VTT files for video |
Amazon Polly
Text-to-Speech (TTS) — text to lifelike audio.
| Feature | Notes |
|---|---|
| Standard Voices | Concatenative TTS |
| Neural TTS (NTTS) | Higher quality, more natural |
| SSML Support | Markup to control emphasis, speed, pauses, pronunciation |
| Lexicons | Custom pronunciation rules (e.g. "AWS" → "Amazon Web Services") |
| Speech Marks | Metadata: word/sentence timings (for lip sync, karaoke) |
| Long-form TTS | Optimised for long documents |
Amazon Translate
Neural machine translation — real-time or batch.
- 75+ languages
- Formality control — formal vs informal tone
- Custom terminology — preserve brand names, technical terms
- Active Custom Translation — fine-tune with parallel data (source → target examples)
- Real-time: API call with text
- Batch: S3 input → S3 output
Amazon Lex
Build conversational chatbots — powers Alexa.
| Component | Description |
|---|---|
| Intent | What the user wants to do (BookHotel, GetWeather) |
| Utterance | Example phrases that trigger the intent |
| Slot | Parameter to collect (city, date, room type) |
| Fulfillment | Lambda function called when intent is satisfied |
- Supports voice + text
- Integrates with: Connect (call centres), Kendra (Q&A), Lambda (fulfilment)
- V2 supports multi-turn conversations and streaming responses
Amazon Kendra
Intelligent enterprise search — NLP-powered.
- Indexes documents from: S3, SharePoint, Salesforce, ServiceNow, RDS, websites
- Returns specific answers (not just documents) — understands questions
- Custom synonyms — map internal terminology
- Access Control — respects document-level permissions
- Incremental learning — improves from user feedback
Kendra vs OpenSearch:
- Kendra = semantic NLP-based question answering, enterprise docs
- OpenSearch = keyword/full-text search, log analysis, custom ranking
Amazon Forecast
Time series forecasting service — no ML expertise needed.
| Feature | Notes |
|---|---|
| AutoML | Automatically selects best algorithm |
| Built-in algorithms | DeepAR+, NPTS, ETS, ARIMA, Prophet |
| Related time series | Include external factors (price, promotions, weather) |
| Item metadata | Add static attributes (category, brand) |
| Probabilistic forecasts | Returns P10, P50, P90 quantiles (confidence intervals) |
| What-if Analysis | Simulate scenarios (what if price drops 10%?) |
| Explainability | Impact of each feature on forecast |
DeepAR (SageMaker) vs Amazon Forecast:
- Forecast = fully managed, no code, AutoML
- DeepAR = more control, custom training, SageMaker integration
Amazon Personalize
Real-time personalisation and recommendation service.
| Recipe | Use Case |
|---|---|
| User-Personalisation | Personalised item recommendations per user |
| Personalised-Ranking | Re-rank a pre-filtered list for a user |
| Similar-Items | "Customers also viewed" |
| Trending-Now | Items gaining popularity |
| Next-Best-Action | Predict next best action/interaction |
Concepts:
- Dataset Group — container for datasets
- Interactions dataset — required (user-item events: clicks, purchases, ratings)
- Items dataset — optional (item metadata: category, price)
- Users dataset — optional (user metadata: age, gender)
- Solution — trained model (choose recipe)
- Campaign — deployed solution for real-time recommendations
Real-time vs Batch:
- Real-time: Campaign endpoint — call API per user
- Batch: Batch Segment job — generate recs for all users at once → S3
Amazon Bedrock
Fully managed access to Foundation Models (FMs) via API.
Overview
- No infrastructure to manage — call FM via API
- Models from: Anthropic (Claude), Meta (Llama), Mistral, Amazon (Titan), Cohere, AI21
- Private — your data and prompts are NOT used to train base models
- Fine-tuned models stay in your account
Core Features
| Feature | Description |
|---|---|
| Model Inference | Call any supported FM via unified API |
| Knowledge Bases | Managed RAG — connect FM to your documents |
| Agents | Autonomous agents that call APIs and take actions |
| Fine-tuning | Customise FM with your labelled data |
| Continued Pre-training | Adapt FM to your domain with unlabelled data |
| Model Evaluation | Compare models on custom prompts |
| Guardrails | Filter harmful content, PII, off-topic responses |
| Watermark Detection | Detect AI-generated text (Amazon Titan) |
Knowledge Bases (RAG)
graph LR
Docs[Your Documents<br/>S3] --> KB[Knowledge Base<br/>Bedrock]
KB --> VS[Vector Store<br/>OpenSearch / Aurora / Pinecone]
User[User Query] --> KB
KB --> FM[Foundation Model]
FM --> Answer[Grounded Answer]
- Manages chunking, embedding, and vector store indexing automatically
- Supported vector stores: OpenSearch Serverless, Aurora PostgreSQL, Pinecone, Redis Enterprise
Bedrock Agents
- Autonomously break tasks into steps, call APIs, retrieve knowledge
- Action Groups — define what APIs the agent can call (Lambda + OpenAPI spec)
- Memory — agents can remember conversation context across sessions
- Use for: customer service automation, data analysis, multi-step workflows
Guardrails
- Content filtering — block hate speech, violence, sexual content
- Denied topics — block specific subjects (e.g. competitor mentions)
- Word filters — block specific words/phrases
- PII redaction — detect and redact personal data
- Grounding check — verify response is grounded in context (reduces hallucination)
Amazon Titan Models
| Model | Type |
|---|---|
| Titan Text Premier/Express/Lite | Text generation |
| Titan Text Embeddings | Text embeddings for RAG |
| Titan Multimodal Embeddings | Image + text embeddings |
| Titan Image Generator | Text-to-image |
AI Services — Exam Comparison Table
| Scenario | Service |
|---|---|
| Detect objects in uploaded photos | Rekognition |
| Moderate user-generated content | Rekognition Content Moderation |
| Verify user identity with face match | Rekognition Face Comparison |
| Extract sentiment from customer reviews | Comprehend |
| Find medical terms in clinical notes | Comprehend Medical |
| Extract tables from PDF invoices | Textract |
| Transcribe call centre recordings | Transcribe |
| Build IVR chatbot for call centre | Lex + Connect |
| Enterprise document search ("What is our leave policy?") | Kendra |
| Demand forecasting with promotions data | Amazon Forecast |
| Personalised product recommendations | Personalize |
| Text-to-speech for e-learning | Polly |
| Translate product listings to 10 languages | Translate |
| Build RAG chatbot on company documents | Bedrock Knowledge Bases |
| Build autonomous AI agent for customer service | Bedrock Agents |
| Fine-tune Claude on proprietary data | Bedrock Fine-tuning |
| Prevent AI from responding off-topic | Bedrock Guardrails |
| Access GPT-like model without OpenAI | Bedrock (Claude, Titan, Llama) |