Job Title: Lead Agentic Data Engineer ( Hybrid) Location: Richmond, VA Duration: 6+ Months Pay Rate: $100/hr on C2C Interview Process: Both Webcam & In Person Job Description Agentic Data Engineer to design, develop, and deploy data pipelines that leverage agentic AI that solve real-world problems Client is seeking a highly skilled Agentic Data Engineer to design, develop, and deploy data pipelines that leverage agentic AI that solve real-world problems. The ideal candidate will have experience in designing data process to support agentic systems, ensure data quality and facilitating interaction between agents and data. Responsibilities: • Designing and developing data pipelines for agentic systems, develop Robust data flows to handle complex interactions between AI agents and Data sources. • Ability to train and fine tune large language models • Design and build the data architecture, including databases, data lakes to support various data engineering tasks. • Develop and manage Extract, Load, transform (ELT) processes to ensure data is accurately and efficiently moved from source systems to analytical platforms used in data science. • Implement data pipelines that facilitate feedback loops, allowing human input to improve system performance in human-in-the-loop systems. • Work with vector databases to store and retrieve embeddings efficiently. • Collaborate with data scientists and engineers to preprocess data, train models, and integrate AI into applications. • Optimize data storage and retrieval with high performance • Statistical analysis, trends, patterns to create data formats from multiple sources. Qualifications: • Strong Data engineering fundamentals • Utilize Big data frameworks like Spark/Databricks • Training LLMs with structed and unstructured data sets. • Understanding of Graph DB • Experience with Azure Blob Storage, Azure Data Lakes, Azure Databricks • Experience implementing Azure Machine Learning, Azure Computer Vision, Azure Video Indexer, Azure OpenAI models, Azure Media Services, Azure AI Search • Determine effective data partitioning criteria • Utilize data storage system spark to implement partition schemes • Understanding core machine learning concepts and algorithms • Familiarity with Cloud computing skills • Strong programming skills in Python and experience with AI/ML frameworks. • Proficiency in vector databases and embedding models for retrieval tasks. • Expertise in integrating with AI agent frameworks. • Experience with cloud AI services (Azure AI). • Experience with GIS spatial data to create markers on maps ( lat long nearest topology of road, geo-locate between datasets, correlation etc.). • Experience with Department of Transportation Data Domains developing an AI Composite Agentic Solution designed to identify and analyze data models, connect & correlate information to validate hypotheses, forecast, predict and recommend potential strategies and conduct What-if analysis. • Bachelor's or master's degree in computer science, AI, Data Science, or a related field. Top Skills & Years of Experience Skill Required /Desired Amount of Experience Understanding the Big data Technologies Required 1 Years Experience developing ETL and ELT pipelines Required 1 Years Experience with Spark, GraphDB, Azure Databricks Required 1 Years Expertise in Data Partitioning Required 1 Years Experience with Data conflation Required 3 Years Experience developing Python Scripts Required 3 Years Experience training LLMs with structured and unstructured data sets Required 2 Years Experience with GIS spatial data Required 3 Years Recruiter Details: Name: Lokesh at gsksolution dot com Contact : Eight three two- Nine nine zero - Two four two six About US: GSK Solutions Inc. is a premier information technology services company dedicated to delivering exceptional consulting solutions and staff augmentation to our valued clients. With an unwavering commitment to quality, timeliness, and budgetary considerations, we consistently strive to exceed client expectations, building a strong reputation through our reliable execution. Our expertise spans commercial and custom product development, covering information security, software development, consulting, and IT audits. We excel in managing critical, time-sensitive projects for Fortune 500 clients nationwide, ensuring their success is always at the forefront of our mission.
Job Type
Fulltime role
Skills required
Azure, Python
Location
Richmond, Virginia
Salary
No salary information was found.
Date Posted
March 18, 2025
GSK Solutions is seeking a Lead Agentic Data Engineer to design and develop data pipelines leveraging agentic AI in Richmond, VA. The role involves collaboration with data scientists and engineers to enhance AI systems through robust data architecture and processing.