如何用API Gateway和Lambda函数读取AWS Aurora只读副本数据并同步至DynamoDB？

阿华AIGC实验室

2026-5-27

Hey there! Since you're new to AWS Aurora, let's break this down into simple, actionable steps to get your data from the Aurora read replica to DynamoDB via an API request. Here's a beginner-friendly, step-by-step plan:

1. Connect to Your Aurora Read Replica

First, you need a safe way to query your read replica (using it avoids putting unnecessary load on your primary Aurora instance):

Option 1: AWS Lambda (Serverless & Low-Cost)
Lambda is perfect for beginners because you don't have to manage servers. You'll need to:
- Assign a Lambda IAM role with permission to access your Aurora cluster's VPC (since Aurora usually lives in private subnets).
- Add a MySQL driver to your Lambda function—like pymysql for Python or mysql2 for Node.js—to establish a connection.
- Write code to run SELECT queries against your read replica's endpoint.
Option 2: EC2 Instance (More Control)
If you need a long-running service or handle larger datasets, spin up an EC2 instance. Configure its security group to allow traffic to Aurora's default port (3306), then install your preferred MySQL client or application framework on the instance.

2. Build Your API Trigger

To trigger the data sync via an API call, use AWS API Gateway—it integrates seamlessly with Lambda or EC2:

Create either a REST API or HTTP API in API Gateway.
Link it to your Lambda function (or point it to your EC2 service's endpoint). When someone calls the API, it will trigger your backend to fetch data from Aurora.

3. Sync Data to DynamoDB

Once you've fetched data from Aurora, you need to write it to DynamoDB. Here's how to do it with Lambda (the most straightforward path for beginners):

Add DynamoDB write permissions to your Lambda IAM role (allow dynamodb:PutItem or dynamodb:BatchWriteItem).
Use the AWS SDK (like boto3 for Python or aws-sdk for Node.js) to map Aurora rows to DynamoDB items and write them.

Here's a quick Python example for Lambda:

import boto3
import pymysql
from botocore.exceptions import ClientError

def lambda_handler(event, context):
    # Fetch DB credentials from Secrets Manager (never hardcode!)
    secrets_manager = boto3.client('secretsmanager')
    try:
        secret = secrets_manager.get_secret_value(SecretId='your-aurora-secret-name')
        db_credentials = eval(secret['SecretString'])
    except ClientError as e:
        return {'statusCode': 500, 'body': f'Failed to get DB credentials: {str(e)}'}

    # Connect to Aurora Read Replica
    try:
        conn = pymysql.connect(
            host=db_credentials['host'],
            user=db_credentials['username'],
            password=db_credentials['password'],
            database=db_credentials['dbname']
        )
        cursor = conn.cursor()
        cursor.execute("SELECT id, name, value FROM your_target_table")
        rows = cursor.fetchall()
        conn.close()
    except Exception as e:
        return {'statusCode': 500, 'body': f'Failed to query Aurora: {str(e)}'}

    # Write to DynamoDB
    dynamodb = boto3.resource('dynamodb')
    table = dynamodb.Table('your-dynamodb-table-name')
    try:
        with table.batch_writer() as batch:
            for row in rows:
                batch.put_item(Item={
                    'id': row[0],
                    'name': row[1],
                    'value': row[2]
                    # Map other fields to match your DynamoDB schema
                })
    except ClientError as e:
        return {'statusCode': 500, 'body': f'Failed to write to DynamoDB: {str(e)}'}

    return {'statusCode': 200, 'body': f'Successfully synced {len(rows)} items to DynamoDB'}

4. Critical Best Practices for Beginners

Never Hardcode Credentials: Store Aurora passwords in AWS Secrets Manager and pull them in your code (like the example above). Use IAM roles for Lambda/EC2 instead of access keys.
Error Handling: Add try/catch blocks to handle connection failures, query errors, or DynamoDB write issues—this makes debugging way easier.
VPC Configuration: If using Lambda, make sure it's attached to the same VPC as your Aurora cluster, and that security groups allow inbound traffic from Lambda to Aurora.
Data Validation: Double-check that your Aurora data matches the DynamoDB table's schema (e.g., correct data types for keys) to avoid write failures.
Rate Limiting: Set up rate limits in API Gateway to prevent overwhelming your Aurora replica or Lambda function with too many requests.

5. Test the End-to-End Flow

Test Lambda First: Use the Lambda console's test feature to run your function and verify it fetches data from Aurora and writes to DynamoDB.
Test the API: Call your API Gateway endpoint (use tools like Postman or curl) and check if the data appears in DynamoDB.

If you hit snags with specific parts—like VPC permissions or schema mismatches—just ask for more details, and I can help you work through it!

内容的提问来源于stack exchange，提问作者ABCD