Key Required Skills: - Strong knowledge of AI, Machine Learning (ML), Large Language Models (LLM), Python, Natural Language Processing (NLP), and experience in the clinical domain. Position Description: - Stay updated on new methods in NLP, ML, and Generative AI. - Understand real-world challenges and develop automated data solutions. - Develop, test, and deploy new techniques for NLP understanding. - Achieve scalable development and deployment of ML and Generative AI approaches (such as LLMs). - Train and optimize NLP/LLM models and create Python-based pipelines. - Determine the nature of analytic problems, evaluate options, and offer recommendations for resolution. - Advise on methods and data needed or available to evaluate intelligence or data problems. - Collaborate with data collectors and analysts to identify and address gaps in complex monitoring problems. - Provide accurate, timely, and sophisticated data analysis. Basic Qualifications: - Bachelor's degree in Statistics, Applied Mathematics, Computer Science, or Information Science, along with industry experience in NLP, data science, and AI/ML/LLM engineering. - A minimum of 8 years of experience as a Data Scientist. - Must be able to obtain and maintain a Public Trust (contract requirement). Required Skills: - Experience with Natural Language Processing (NLP), Generative AI, and Large Language Models (LLM). - Fluency in Python programming, version control, and collaboration using GIT, along with standard Python packages (e.g., Pandas, NumPy, Matplotlib) and ML frameworks. - Knowledge of TensorFlow, PyTorch, Pandas, scikit-learn, and NLTK, with optional experience in Azure ML and Amazon Web Services EC2. - Experience with scalable data engineering frameworks such as Apache Spark and orchestration frameworks like Airflow, as well as experience with semantic search. - Expertise in conducting data analysis and applying advanced statistical concepts and ML methods to build, train, test, and evaluate various supervised and unsupervised analytic models. - Experience with ML model deployment and operations, including DevOps, MLOps, and LLMOps. - Familiarity with NLP and Generative AI libraries, such as regular expressions (e.g., SpaCy, Langchain), text annotation tools, and semantic frameworks. - Ability to clean and process large amounts of real-world data. - Experience retrieving and manipulating data from various sources, including DB2, Oracle, SQL Server, Hadoop, and flat files. - Proficiency with database management systems (e.g., PostgreSQL, MySQL, SQLite, SQL, etc.). - Excellent analytical skills to identify potential risks and propose effective solutions. - Strong problem-solving skills and the ability to collaborate with cross-functional teams. - Proven communication skills, both written and verbal, tailored to various audiences, including executive leadership. Desired Skills: - Prior experience working on applications in the clinical domain. - Experience with federal or state government IT projects. - Familiarity with distributed processing via the Hadoop ecosystem (e.g., Spark, Impala, Hive). - Experience in an analytical research environment. - Knowledge of parallel processing, such as GPU programming with CUDA. - Familiarity with Mathematica. - Experience using markup languages such as LaTeX and HTML. - Experience with Natural Language Processing for anomaly detection. Education: - Bachelor's degree with 12+ years of experience. - Must be able to obtain and maintain a Public Trust (contract requirement). Job Types: Full-time, Contract Pay: Up to $89.00 per hour Education: • Bachelor's (Required) Experience: • AI: 9 years (Required) • Machine Learning (ML): 9 years (Required) • Large Language Models (LLM): 9 years (Required) • Python: 9 years (Required) • Natural Language Processing (NLP): 9 years (Required) • clinical domain: 9 years (Required) • data science: 9 years (Required) • AI/ML/LLM engineering: 9 years (Required) • Data Scientist: 8 years (Required) • Generative AI: 9 years (Required) • TensorFlow, PyTorch, Pandas, scikit-learn, and NLTK: 9 years (Required) Security clearance: • Confidential (Required) Ability to Commute: • Windsor Mill, MD 21244 (Required) Work Location: In person
Job Type
Fulltime role
Skills required
Python, Azure, PostgreSQL, MySQL
Location
Baltimore, Maryland
Salary
No salary information was found.
Date Posted
May 8, 2025
Vision is seeking an experienced AI/ML/LLM Data Scientist to develop and deploy advanced NLP solutions in the clinical domain. The role requires strong expertise in machine learning, Python, and large language models, with a focus on real-world data challenges.