
Fechado
Publicado
Pago na entrega
I need an Azure Databricks specialist to re-architect my small batch-processing cluster (1-10 nodes) so it reliably runs on spot instances and trims our monthly bill. The objective is simple: reduce overall operational costs without sacrificing job completion time or data integrity. What I expect from you: • Analyse the current workspace, jobs and autoscaling behaviour. • Design and apply an optimal spot-instance strategy (instance mix, bid price, fault-tolerant settings, pool configuration, autoscaling rules). • Implement job-level tweaks such as task parallelism, cluster reuse and termination timing to prevent idle burn. • Set up cost-tracking dashboards or alerts so I can see savings in real time. • Deliver a short hand-off guide that explains the new configuration and roll-back steps. Acceptance criteria • Batch jobs finish within their existing SLA. • Spot utilisation ≥ 80 % over a representative 7-day window. • Demonstrated cost reduction of at least 30 % compared with the current on-demand setup. Tools you are likely to touch: Azure Portal, Databricks CLI, Terraform or ARM templates (if you prefer IaC), and Azure Monitor/Cost Management. I’ll grant temporary contributor access once we agree on the approach, and I’m ready to start as soon as you are.
ID do Projeto: 40295906
7 propostas
Projeto remoto
Ativo há 2 dias
Defina seu orçamento e seu prazo
Seja pago pelo seu trabalho
Descreva sua proposta
É grátis para se inscrever e fazer ofertas em trabalhos
7 freelancers estão ofertando em média ₹1.043 INR for esse trabalho

Hello, I can optimize your Azure Databricks environment to reduce costs while maintaining SLA compliance for your batch-processing workloads. My approach will start with a thorough analysis of your current clusters, jobs, and autoscaling behavior to understand utilization patterns and potential inefficiencies. I will then design and implement a spot-instance strategy that maximizes availability and fault tolerance, tuning bid prices, instance types, pools, and autoscaling rules to target ≥80 % spot usage without risking job failures. Additionally, I will apply job-level optimizations such as parallel task execution, cluster reuse, and automated termination of idle nodes to minimize wasted compute. Cost-tracking dashboards and alerts will be configured in Azure Monitor so you can see real-time savings. Finally, I will deliver a concise hand-off guide covering the new configuration, monitoring approach, and rollback steps. The goal is a minimum 30 % cost reduction with batch jobs completing reliably within existing SLA windows. Regards, Mutahra
₹1.050 INR em 2 dias
0,0
0,0

Hello Sir, I can complete this data entry work with 100% accuracy. I have good experience in Excel, copy paste work and data entry. I will finish the work on time and provide accurate results. Please give me this opportunity. Thank you.
₹1.050 INR em 7 dias
0,0
0,0

Hi, I can help re-architect your Azure Databricks batch cluster to maximize spot instance usage and reduce costs while maintaining job reliability. I’ll review your current workspace configuration, autoscaling behavior, and job execution patterns to design an optimized spot strategy including instance mix, fault-tolerant settings, cluster pools, and autoscaling policies. I’ll also optimize job-level settings such as task parallelism, cluster reuse, and termination policies to avoid idle compute time. Additionally, I’ll set up cost monitoring dashboards and alerts using Azure Monitor/Cost Management so you can track savings in real time. You’ll receive an optimized configuration, documented rollback steps, and guidance to maintain ≥80% spot utilization while achieving significant cost reduction without impacting SLAs. Ready to start once access is provided. Thanks, Anand Shah
₹1.000 INR em 7 dias
0,0
0,0

Good day I trust you are well. Your goal to reduce operational costs by re-architecting the Databricks cluster to maximize spot instance use while maintaining job SLAs and data integrity is clear. I propose analyzing your current autoscaling and job configurations using the Databricks CLI and Terraform to design a fault-tolerant spot instance strategy with optimal instance mix and bid prices. Additionally, I'll implement job-level optimizations like cluster reuse and set up Azure Monitor dashboards to track cost savings in real time, aligning with your requirement for ≥80% spot utilization and 30% cost reduction. Tell me more about your project so I can help make it a success. Regards, Marissa
₹600 INR em 14 dias
0,0
0,0

I've done exactly this - migrated a client from on-demand to spot-optimized Databricks clusters and cut compute costs by 40%. For your 1-10 node batch cluster, here's my approach: Analysis (Day 1-2): - Audit current cluster configs, job run history, and idle time patterns - Identify peak usage windows and spot interruption risk tolerance per job Spot Strategy: - Configure instance pools with spot-first, on-demand fallback - Set up mixed instance types (e.g., Standard_DS3_v2 + DS4_v2) for better spot availability - Tune autoscaling with aggressive scale-down (60-90 sec idle) to kill idle burn Job-Level Tweaks: - Enable cluster reuse across jobs where possible - Adjust task parallelism based on actual data volume - Set termination policies to avoid zombie clusters Monitoring: - Azure Cost Management dashboard filtered by Databricks resource - Alerts for spot fallback events and cost anomalies Handoff: - Config documentation with rollback steps (I typically use Terraform so rollback is one command) I hold the DP-203 and Databricks Certified Data Engineer certifications. Happy to start with a quick screen-share to review your current setup. What's your current monthly Databricks spend, and which job types are the heaviest?
₹1.500 INR em 7 dias
0,0
0,0

Hello there, Databricks cluster can be tuned to maximize spot usage while keeping batch jobs stable and within SLA. The focus will be optimizing autoscaling rules, instance pools, and job-level execution so the cluster prefers spot nodes but gracefully handles preemptions without interrupting pipelines. Optimization Focus • Spot-first cluster configuration with balanced instance mix and fallback rules • Autoscaling and instance pool tuning to keep spot utilization high • Job-level improvements including task parallelism and cluster reuse • Idle shutdown and termination timing to remove unnecessary compute cost • Cost monitoring dashboards using Azure Monitor and Cost Management Is the current workload mostly scheduled batch jobs or trigger-based pipelines? Looking forward to making this smooth and efficient. Best Regards, Nikunj
₹1.500 INR em 7 dias
0,0
0,0

Hello, I can help you re-architect your Azure Databricks environment to significantly reduce costs while maintaining performance and reliability. With hands-on experience in ETL pipelines, Apache Spark (PySpark), and the Databricks platform, I understand how to optimize cluster configurations, job execution, and autoscaling behavior for cost efficiency. **My approach:** • Analyse your current cluster setup, job workflows, and autoscaling patterns to identify cost inefficiencies. • Design a spot-instance strategy with the right instance mix, autoscaling policies, and fault-tolerant configurations to maximize uptime and savings. • Optimize job execution using parallelism, cluster reuse, and efficient scheduling to eliminate idle resource usage. • Implement monitoring using Azure Monitor and cost tracking dashboards to provide real-time visibility into savings. • Provide a clear hand-off guide with documentation and rollback steps for safety. **Outcome you can expect:** • Reduced infrastructure costs (targeting 30%+ savings) • High spot instance utilization with stable job execution • No compromise on SLA, data integrity, or performance I focus on practical, scalable solutions and clean implementation. I am ready to start immediately and can adapt to your preferred tools like Terraform or CLI. Let’s discuss your current setup and identify quick wins. Best regards, Ayush
₹600 INR em 7 dias
0,0
0,0

New Delhi, India
Método de pagamento verificado
Membro desde mai. 1, 2016
₹100-400 INR / hora
₹400-750 INR / hora
₹400-750 INR / hora
₹600-1500 INR
₹600-1500 INR
₹500000-521000 INR
$2-8 USD / hora
₹37500-75000 INR
$2-8 USD / hora
$15-25 USD / hora
$250-750 USD
$30-250 USD
₹750-1250 INR / hora
₹37500-75000 INR
$10000-20000 USD
$30-250 USD
$30-250 USD
₹100-400 INR / hora
$2-8 USD / hora
$8-15 USD / hora
$2-8 USD / hora
₹1500-12500 INR
$15-25 USD / hora
₹1500-12500 INR
₹250000-500000 INR