AWS News Blog
Category: Analytics
New Scheduling Options for AWS Data Pipeline
The AWS Data Pipeline lets you automate the movement and processing of any amount of data using data-driven workflows and built-in dependency checking. Today we are making the Data Pipeline more flexible and more useful with the addition of a new scheduling model that works at the level of an entire pipeline. This builds upon […]
Cross-Region Export and Import of DynamoDB Tables
Two of the most frequent feature requests for Amazon DynamoDB involve backup/restore and cross-Region data transfer. Today we are addressing both of these requests with the introduction of a pair of scalable tools (export and import) that you can use to move data between a DynamoDB table and an Amazon S3 bucket. The export and […]
AWS Data Pipeline Now Available in Four More Regions
As we often do, we launched AWS Data Pipeline in a single AWS Region (US East (Northern Virginia) to be precise). It is now available in the following Regions as well: US West (Oregon) EU (Ireland) Asia Pacific (Sydney) Asia Pacific (Tokyo) AWS Data Pipeline has always supported data flows between Regions. With today’s release, […]
Process Streaming Data with Kinesis and Elastic MapReduce
Regular readers of this blog already know that Amazon Kinesis is a fully managed service for real-time processing of streaming data at massive scale. As I noted last month when we introduced the Kinesis Storm Spout, Kinesis is but one component of a complete end-to-end streaming data application. In order to build such an application, […]
Marry Amazon Kinesis and Storm Using the New Kinesis Storm Spout
Amazon Kinesis is a fully managed service for real-time processing of streamed data at massive scale. When we launched Kinesis in November of 2013, we also introduced the Kinesis Client Library. You can use the client library to build applications that process streaming data. It will handle complex issues such as load-balancing of streaming data, […]
Amazon Kinesis – Public Beta Now Open
We announced a limited preview of Amazon Kinesis last month at AWS re:Invent. If you aren’t familiar with Kinesis, take a quick look at my recent post, Amazon Kinesis – Real-Time Processing of Streaming Big Data, to learn more. I am pleased to announce that Kinesis is now available in public beta form, and that […]
Amazon Kinesis – Real-Time Processing of Streaming Big Data
Imagine a situation where fresh data arrives in a continuous stream, 24 hours a day, 7 days a week. You need to capture the data, process it, and turn it into actionable conclusions as soon as possible, ideally within a matter of seconds. Perhaps the data rate or the compute power required for the analytics […]
AWS Data Pipeline Now Supports Amazon Redshift
AWS Data Pipeline (see my introductory blog post for more information) is a web service that helps you to integrate and process data across compute and storage services at specified intervals. You can transform and process data that is stored in the cloud or on-premises in a highly scalable fashion without having to worry about […]