AWS Architecture Blog
Category: AWS Lambda
Architecting a Data Lake for Higher Education Student Analytics
One of the keys to identifying timely and impactful actions is having enough raw material to work with. However, this up-to-date information typically lives in the databases that sit behind several different applications. One of the first steps to finding data-driven insights is gathering that information into a single store that an analyst can use […]
Unlocking Data from Existing Systems with a Serverless API Facade
In today’s modern world, it’s not enough to produce a good product; it’s critical that your products and services are well integrated into the surrounding business ecosystem. Companies lose market share when valuable data about their products or services are locked inside their systems. Business partners and internal teams use data from multiple sources to […]
Nielsen: Processing 55TB of Data Per Day with AWS Lambda
Earlier this year, I went into the studio with Opher Dubrovsky from Nielsen Marketing Cloud (a data management platform) to record an episode of This is My Architecture about Big Data architecture. In preparation for the recording and during my initial conversations with Opher, I realized that there is an amazing story here that can […]
Field Notes: Monitoring the Java Virtual Machine Garbage Collection on AWS Lambda
September 8, 2021: Amazon Elasticsearch Service has been renamed to Amazon OpenSearch Service. See details. When you want to optimize your Java application on AWS Lambda for performance and cost the general steps are: Build, measure, then optimize! To accomplish this, you need a solid monitoring mechanism. Amazon CloudWatch and AWS X-Ray are well suited […]
Fundbox: Simplifying Ways to Query and Analyze Data by Different Personas
Fundbox is a leading technology platform focused on disrupting the $21 trillion B2B commerce market by building the world’s first B2B payment and credit network. With Fundbox, sellers of all sizes can quickly increase average order volumes (AOV) and improve close rates by offering more competitive net terms and payment plans to their SMB buyers. […]
BBVA: Architecture for Large-Scale Macie Implementation
This post was co-written by Andrew Alaniz , Director of Technology Infrastructure, and Brady Pratt, Cloud Security Engineer, both at BBVA USA. Introduction Data Loss Prevention (DLP) is a common topic among companies that work with any type of sensitive data. One of the challenges is that many people either don’t fully understand what DLP […]
Liberty IT Adopts Serverless Best Practices Using AWS Cloud Development Kit
This post was co-written with Matthew Coulter, Lead Technical Architect of Global Risk at Liberty Mutual Liberty IT Solutions, part of Liberty Mutual Group, has been using AWS CloudFormation to deploy serverless applications on AWS for the last four years. These deployments typically involve defining, integrating, and monitoring services such as AWS Lambda, Amazon API […]
Serverless Architecture for a Web Scraping Solution
If you are interested in serverless architecture, you may have read many contradictory articles and wonder if serverless architectures are cost effective or expensive. I would like to clear the air around the issue of effectiveness through an analysis of a web scraping solution. The use case is fairly simple: at certain times during the […]
Building a Scalable Document Pre-Processing Pipeline
In a recent customer engagement, Quantiphi, Inc., a member of the Amazon Web Services Partner Network, built a solution capable of pre-processing tens of millions of PDF documents before sending them for inference by a machine learning (ML) model. While the customer’s use case—and hence the ML model—was very specific to their needs, the pipeline that does […]
Serving Billions of Ads in Just 100 ms Using Amazon Elasticache (Redis OSS)
This post was co-written with Lucas Ceballos, CTO of Smadex Introduction Showing ads may seem to be a simple task, but it’s not. Showing the right ad to the right user is an incredibly complex challenge that involves multiple disciplines such as artificial intelligence, data science, and software engineering. Doing it one million times per […]