open-menu
closeme
Home
Videos
Feeds
About
Contribute
github
linkedin
P-EAGLE: Faster LLM inference with Parallel Speculative Decoding in vLLM
calendar
Mar 13, 2026
·
aws.amazon.com/blogs/machine-learning
Improve operational visibility for inference workloads on Amazon Bedrock with new CloudWatch metrics for TTFT and Estimated Quota Consumption
calendar
Mar 12, 2026
·
aws.amazon.com/blogs/machine-learning
Secure AI agents with Policy in Amazon Bedrock AgentCore
calendar
Mar 12, 2026
·
aws.amazon.com/blogs/machine-learning
Multimodal embeddings at scale: AI data lake for media and entertainment workloads
calendar
Mar 12, 2026
·
aws.amazon.com/blogs/machine-learning
Fine-tuning NVIDIA Nemotron Speech ASR on Amazon EC2 for domain adaptation
calendar
Mar 12, 2026
·
aws.amazon.com/blogs/machine-learning
Operationalizing Agentic AI Part 1: A Stakeholder’s Guide
calendar
Mar 11, 2026
·
aws.amazon.com/blogs/machine-learning
Accelerate custom LLM deployment: Fine-tune with Oumi and deploy to Amazon Bedrock
calendar
Mar 10, 2026
·
aws.amazon.com/blogs/machine-learning
Run NVIDIA Nemotron 3 Nano as a fully managed serverless model on Amazon Bedrock
calendar
Mar 9, 2026
·
aws.amazon.com/blogs/machine-learning
Access Anthropic Claude models in India on Amazon Bedrock with Global cross-Region inference
calendar
Mar 9, 2026
·
aws.amazon.com/blogs/machine-learning