
Fechado
Publicado
Pago na entrega
Title: Data Ingestion from S3 to RDS with Fallback to AWS Glue using Dockerized Python Application Objective: Develop a Dockerized Python application that automates the process of: Reading data from an Amazon S3 bucket Pushing it to an RDS (MySQL-compatible) database Automatically falling back to AWS Glue if the RDS database is unavailable or the push operation fails This project helps you integrate multiple AWS services (S3, RDS, Glue), work with data pipelines, and use Docker for packaging and deployment. Project Tasks: 1. Python Script Development: Create a Python script that: o Reads a CSV file from an S3 bucket o Parses it using pandas o Tries to upload the data to an RDS MySQL-compatible database o If the RDS upload fails, falls back to: Creating a table in AWS Glue Data Catalog Registering the dataset location in S3 2. Dockerfile Creation: Build a Dockerfile that: o Uses a Python 3.9+ base image o Installs all necessary libraries (boto3, pandas, sqlalchemy, pymysql) o Copies the Python script into the container o Runs the script on container startup 3. Requirements File: Create a [login to view URL] with all necessary Python dependencies: 4. Image Build and Container Run: Build the Docker image: Run the container using environment variables for AWS credentials: Configuration Parameters (Replace with actual values): S3 Bucket Name CSV File Key RDS DB Endpoint, Username, Password, DB Name, Table Name Glue Database Name, Table Name, and S3 Location Deliverables: 1. Python script and Dockerfile stored in a GitHub repository 2. Working Docker image and container logs showing: o Successful push to RDS or fallback to Glue 3. Screenshot of: o Records inserted into RDS or o Table created in AWS Glue Catalog 4. Summary Report (1–2 pages) explaining: o Data flow (S3 → RDS → Glue fallback) o AWS services used o Docker setup o Challenges faced and how they were solved
ID do Projeto: 40316186
11 propostas
Projeto remoto
Ativo há 26 dias
Defina seu orçamento e seu prazo
Seja pago pelo seu trabalho
Descreva sua proposta
É grátis para se inscrever e fazer ofertas em trabalhos
11 freelancers estão ofertando em média ₹6.494 INR for esse trabalho

As a seasoned AWS-certified professional, I have successfully built numerous complex and scalable cloud infrastructures that tightly integrate multiple AWS services, such as S3, RDS and Glue. My proficiency in Python coupled with deep understanding of Docker made me the right fit for this project from day one. I bring over 5 years of hands-on experience in backend development, DevOps engineering and using Kubernetes for containerized applications which are all vital in executing this task. My clients value my proactive approach to problem-solving; whether it's automatically falling back to Glue or troubleshooting an AWS-related issue, I ensure your data pipelines stay intact. In addition to my technical skills, my certifications -- AWS Certified Solutions Architect – Associate and AWS Certified Practitioner -- validate my prowess in delivering efficient, secure and complaint solutions on the AWS ecosystem. Working with me means gaining access to a plethora of tools not limited to Docker, Kubernetes and Terraform plus integrating AI/ML solutions when implied for enhanced automations to achieve business outcomes. Lastly, being adept at maintaining security & compliance standards (HIPAA, PCI-DSS) will ensure your data safely traverses through S3, RDS and Glue. Dependability is key here - let me handle your Data Ingestion project!
₹12.500 INR em 7 dias
5,4
5,4

I am a Full-Stack Engineer with 4+ years of experience specializing in AWS (S3, RDS, Glue), Python, and Docker. I will develop a robust, containerized data pipeline that automates CSV ingestion from S3 to RDS with a seamless fallback to the AWS Glue Data Catalog. Technical Solution Python/Pandas Logic: A resilient script using boto3, sqlalchemy, and pymysql with structured error handling for immediate Glue fallback. Dockerization: A lightweight Python 3.9+ image optimized for environment-variable-driven deployments. AWS Integration: Automated table creation in Glue and schema mapping for RDS MySQL. Deliverables GitHub Repository: Clean, modular code including the Dockerfile and requirements.txt. Validation: Comprehensive container logs and screenshots of successful data insertion. Summary Report: A professional 2-page architectural overview and troubleshooting guide. I ensure an ATS-friendly, professional delivery that meets all your project milestones. Would you like me to include a sample .env template for your AWS credentials?
₹5.000 INR em 5 dias
4,8
4,8

Your fallback mechanism from RDS to Glue Data Catalog is smart - I'll build the Dockerized Python app with proper error handling using boto3, pandas, and SQLAlchemy. The container will attempt RDS insertion first, then automatically register the dataset in Glue if the database push fails. I built a similar data pipeline for my price aggregation engine that processes 800+ products daily with fallback mechanisms, plus automated systems that handle AWS integrations reliably. You can see my work at ffulb.com. Need your AWS credentials for S3, RDS, and Glue to develop and test properly - once I assess your setup I can deliver the complete Docker solution within a few days.
₹4.800 INR em 5 dias
3,1
3,1

I came across your project requirement for building a Dockerized Python application to automate data ingestion from S3 to RDS with a fallback mechanism using AWS Glue, and I’d be happy to assist. I have hands-on experience working with AWS services like S3, RDS, and Glue, along with building scalable data pipelines using Python, pandas, boto3, and SQLAlchemy. I can deliver a robust solution that: Efficiently reads and processes CSV data from S3 Inserts structured data into an RDS MySQL database Implements a reliable fallback mechanism using AWS Glue in case of failure Is fully containerized using Docker for easy deployment and portability Additionally, I will provide: Clean, well-documented code in a GitHub repository A production-ready Dockerfile and requirements setup Proper logging for success/failure tracking A concise summary report explaining the architecture and workflow I focus on writing maintainable, production-grade code with proper error handling and AWS best practices. I’d be glad to discuss your requirements in more detail and get started right away. Looking forward to working with you.
₹3.000 INR em 5 dias
0,0
0,0

I have hands-on experience in Python, AWS, and data engineering, making me well-suited for this project. I can develop a robust Dockerized Python application that reads data from S3, processes it using pandas, and efficiently loads it into an RDS MySQL database. I will implement a reliable fallback mechanism using AWS Glue to ensure data availability even in case of RDS failure. I will ensure clean, modular code, proper error handling, and secure use of environment variables for AWS credentials. The Docker container will be optimized for easy deployment and reproducibility. Additionally, I will provide complete documentation, logs, and a clear summary report explaining the data flow, architecture, and challenges. I can deliver this project within the given timeline with high quality and reliability.
₹7.000 INR em 10 dias
0,0
0,0

Hi, I have strong experience working with AWS services like S3, RDS, and AWS Glue, along with Dockerized Python applications, which makes me a great fit for this project. I understand your requirement clearly: - Read CSV data from S3 using Python (boto3 + pandas) - Insert data into RDS (MySQL) using SQLAlchemy/PyMySQL - Implement a fallback mechanism using AWS Glue Data Catalog if RDS fails - Containerize the complete solution using Docker My approach: 1. Develop a robust Python script with proper exception handling 2. Use environment variables for secure configuration (AWS creds, DB details) 3. Implement retry + fallback logic to Glue in case of DB failure 4. Create a clean and optimized Dockerfile 5. Provide logs and validation (RDS insert / Glue table creation) 6. Deliver complete GitHub repo + documentation + summary report I will ensure: ✔ Clean, modular, and production-ready code ✔ Proper error handling and logging ✔ Easy deployment using Docker ✔ Clear documentation and screenshots Quick questions: 1. Do you already have RDS and S3 configured? 2. Should Glue crawler also be included or only table creation? I can complete this within 5–7 days. Looking forward to working with you! Thanks, Ramana
₹7.000 INR em 5 dias
0,0
0,0

Hi , I can build a reliable Dockerized pipeline for S3 → RDS with a smart fallback to AWS Glue. I’ll ensure proper error handling, schema consistency, and clean modular code so it’s production-ready—not just a quick script. Happy to get started right away. Thanks, Sowmiya
₹8.000 INR em 7 dias
0,0
0,0

Pune, India
Membro desde mar. 21, 2026
₹100-400 INR / hora
$2-8 USD / hora
$30-250 CAD
$3000-5000 USD
$8-15 USD / hora
₹75000-150000 INR
£20-250 GBP
₹1500-12500 INR
$5000-10000 USD
$10-30 USD
$30-250 USD
$30-250 USD
₹12500-37500 INR
£5000-10000 GBP
₹1500-12500 INR
₹12500-37500 INR
$8-15 USD / hora
₹1500-12500 INR
₹12500-37500 INR
€250-750 EUR