Databricks Engineer

Job description

Hi, Hope you are doing well. This is Ravali from My3 Tech. Review the below job description and let me know your interest by replying to this email with updated resume and convenient time to discuss and also you can reach me @ 605-674-1104 (or) through email atRavali@My3Tech.com Title: Databricks Engineer Location : Cincinnati , Ohio(Hybrid)Open to candidate who will relocate Client : ADM - Cincinnati Duration : Contract Must have: Data Management Data Pipeline Development: Design, develop, and maintain robust data pipelines using Databricks to process and transform large volumes of data. ETL Process Management: Implement ETL (Extract, Transform, Load) processes to integrate data from various sources into Databricks, ensuring data quality and integrity. Data Integration: Integrate Databricks with other data storage solutions and data lakes, ensuring seamless data flow and accessibility. Performance Optimization: Optimize data processing and query performance within Databricks to ensure efficient data retrieval and processing. Data Analysis and Visualization: Utilize Databricks to perform complex data analysis and create visualizations to support data-driven decision-making. Collaborate with Data Scientists and Analysts: Work closely with data scientists and analysts to understand their requirements and provide the necessary infrastructure and tools within Databricks. Security and Compliance: Ensure that data processing within Databricks complies with organizational security policies and industry regulations, implementing necessary security measures. This includes setting up encryption, managing network security configurations, and performing regular security audits. Monitoring and Troubleshooting: Monitor data pipelines and workflows for performance issues or errors, and troubleshoot any problems that arise to maintain smooth operations. Cluster Management: Manage the creation, configuration, and scaling of Databricks clusters to ensure optimal performance and cost-efficiency. This includes monitoring cluster usage, resource allocation, and ensuring high availability. User and Access Management: Implement and manage user access controls, ensuring that only authorized personnel have access to Databricks resources. This involves setting up role-based access controls (RBAC), managing permissions, and integrating with identity management systems. Backup and Disaster Recovery: Develop and implement backup and disaster recovery plans for Databricks environments. Ensure that data and configurations are regularly backed up and that there are clear procedures in place for restoring services in the event of a failure. Technical Skills • Experience with Databricks: Hands-on experience with Databricks, including familiarity with its architecture, features, and services. • Proficiency in Spark: Strong knowledge of Apache Spark, including Spark SQL, Spark Streaming, and Spark MLlib, as Databricks is built on Spark. • Programming Languages: Proficiency in programming languages commonly used in data engineering such as Python, Scala, SQL, and Java. • Data Warehousing and ETL: Experience with data warehousing concepts, ETL processes, and tools like Apache Airflow, Talend, or Informatica. • Database Management: Knowledge of relational and NoSQL databases, data modeling, and query optimization. • Big Data Technologies: Familiarity with big data technologies and ecosystems, including Hadoop, Hive, and Kafka. • Analytical and Problem-Solving Skills • Data Analysis: Ability to perform complex data analysis and create data visualizations to support business decisions. • Problem-Solving: Strong analytical and problem-solving skills to troubleshoot and resolve issues in data pipelines and workflows. Soft Skills • Communication Skills: Excellent verbal and written communication skills to collaborate with data scientists, analysts, and other stakeholders. • Team Collaboration: Ability to work effectively in a team environment and contribute to cross-functional projects. • Certifications (Optional but Beneficial) • Databricks Certifications: Certifications such as Databricks Certified Associate Developer for Apache Spark or Databricks Certified Professional Data Scientist can demonstrate expertise and enhance job prospects. • Cloud Certifications: Certifications from cloud providers (e.g., Azure Certified Solutions Architect, Azure Data Engineer) can be advantageous. Work Experience Relevant Experience: Prior experience working in data engineering, data analytics, or a related field is often required. This includes experience in building and maintaining data pipelines, ETL processes, and data integration. Ravali Technical Recruiter

Requirements

• Must have: Data Management

• Experience with Databricks: Hands-on experience with Databricks, including familiarity with its architecture, features, and services

• Proficiency in Spark: Strong knowledge of Apache Spark, including Spark SQL, Spark Streaming, and Spark MLlib, as Databricks is built on Spark

• Programming Languages: Proficiency in programming languages commonly used in data engineering such as Python, Scala, SQL, and Java

• Data Warehousing and ETL: Experience with data warehousing concepts, ETL processes, and tools like Apache Airflow, Talend, or Informatica

• Database Management: Knowledge of relational and NoSQL databases, data modeling, and query optimization

• Big Data Technologies: Familiarity with big data technologies and ecosystems, including Hadoop, Hive, and Kafka

• Analytical and Problem-Solving Skills

• Data Analysis: Ability to perform complex data analysis and create data visualizations to support business decisions

• Problem-Solving: Strong analytical and problem-solving skills to troubleshoot and resolve issues in data pipelines and workflows

• Communication Skills: Excellent verbal and written communication skills to collaborate with data scientists, analysts, and other stakeholders

• Team Collaboration: Ability to work effectively in a team environment and contribute to cross-functional projects

• Certifications (Optional but Beneficial)

• Databricks Certifications: Certifications such as Databricks Certified Associate Developer for Apache Spark or Databricks Certified Professional Data Scientist can demonstrate expertise and enhance job prospects

• Cloud Certifications: Certifications from cloud providers (e.g., Azure Certified Solutions Architect, Azure Data Engineer) can be advantageous

• Relevant Experience: Prior experience working in data engineering, data analytics, or a related field is often required

• This includes experience in building and maintaining data pipelines, ETL processes, and data integration

SHARE THIS OPENING

Similar jobs

Data & AI Solutions Architect

May 27, 2025

VirtualVocations

Greensboro, North Carolina

Azure

CI/CD

VirtualVocations is seeking a Data & AI Solutions Architect to lead technical strategy for enterprise data and AI solutions. The role involves overseeing the architecture for data platforms and ML systems in Greensboro, North Carolina.

IT - Business Intelligence Developer IV

May 26, 2025

Annex Consulting Group

Indianapolis, Indiana

Annex Consulting Group is seeking an IT - Business Intelligence Developer IV to support BI reporting and automation initiatives. The role involves designing and deploying Power BI dashboards and automating processes using Power Automate.

Principal Group SWE Manager

May 26, 2025

Microsoft

Seattle, Washington

C++

Java

Microsoft is seeking a Principal Group SWE Manager for its Azure Data engineering team in Seattle, Washington, to lead the development of innovative data analytics solutions. This role involves technical leadership in building and operating cloud services and streaming technologies.

Save job

Apply now

Job Type

Fulltime role

Skills required

Python, Java, NoSQL, Azure

Location

Cincinnati, Ohio

Salary

No salary information was found.

Date Posted

May 25, 2025

Save job Apply now

Similar jobs

Data & AI Solutions Architect

May 27, 2025

VirtualVocations

Greensboro, North Carolina

Azure

CI/CD

IT - Business Intelligence Developer IV

May 26, 2025

Annex Consulting Group

Indianapolis, Indiana

Principal Group SWE Manager

May 26, 2025

Microsoft

Seattle, Washington

C++

Java

My3Tech Inc

My3Tech Inc is seeking a Databricks Engineer to design and maintain data pipelines using Databricks in a hybrid role based in Cincinnati, Ohio. The ideal candidate will have strong experience in data management, ETL processes, and Apache Spark.

Grow your career with our tailored content for Microsoft techies

Learn more