{"22224113":{"jobPath":"/jobs/22224113/membership-engagement-coordinator","source":"naylor","job":"22224113","jobTitle":"Membership Engagement Coordinator"},"22210666":{"jobPath":"/jobs/22210666/senior-platform-data-engineer","source":"naylor","job":"22210666","jobTitle":"Senior Platform Data Engineer"},"22224091":{"jobPath":"/jobs/22224091/breast-imaging-radiologist-x28-50-x25-breast-x2f-50-x25-general-x29-x7c-mqsa-required-x7c-grand-junction-co","source":"naylor","job":"22224091","jobTitle":"Breast Imaging Radiologist (50% Breast / 50% General) | MQSA Required | Grand Junction, CO"},"22224093":{"jobPath":"/jobs/22224093/general-radiologist-in-gorgeous-orange-county-x7c-x24-450k-x2b-x24-100k-sign-on-bonus-and-unlimited-earnings-x7c-hybrid-day","source":"naylor","job":"22224093","jobTitle":"General Radiologist in Gorgeous Orange County | $450k+, $100k Sign On Bonus and Unlimited Earnings | Hybrid Day"},"22155444":{"jobPath":"/jobs/22155444/professor-of-the-practice","source":"naylor","job":"22155444","jobTitle":"Professor of the Practice"},"22223739":{"jobPath":"/jobs/22223739/transaction-accountant","source":"naylor","job":"22223739","jobTitle":"Transaction Accountant"},"22221694":{"jobPath":"/jobs/22221694/associate-machine-learning-engineer-secure-ai-lab","source":"naylor","job":"22221694","jobTitle":"Associate Machine Learning Engineer - Secure AI Lab"},"22223499":{"jobPath":"/jobs/22223499/staff-research-scientist","source":"naylor","job":"22223499","jobTitle":"Staff Research Scientist"},"22223576":{"jobPath":"/jobs/22223576/machine-learning-engineer","source":"naylor","job":"22223576","jobTitle":"Machine Learning Engineer"},"22221453":{"jobPath":"/jobs/22221453/general-dentist","source":"naylor","job":"22221453","jobTitle":"General Dentist"},"22223511":{"jobPath":"/jobs/22223511/associate-director-of-security","source":"naylor","job":"22223511","jobTitle":"Associate Director of Security"},"22079225":{"jobPath":"/jobs/22079225/junior-group-leaders-in-artificial-intelligence-and-data-science-f-m","source":"naylor","job":"22079225","jobTitle":"Junior Group Leaders in Artificial Intelligence and Data Science (F/M)"},"22222384":{"jobPath":"/jobs/22222384/ai-ml-research-engineer-programmer","source":"naylor","job":"22222384","jobTitle":"AI/ML Research Engineer/Programmer"},"22221451":{"jobPath":"/jobs/22221451/temp-dentist","source":"naylor","job":"22221451","jobTitle":"Temp Dentist"},"22223590":{"jobPath":"/jobs/22223590/resource-management-department-coordinator-administrative-support-coordinator-ii","source":"naylor","job":"22223590","jobTitle":"Resource Management Department Coordinator (Administrative Support Coordinator II)"},"22166215":{"jobPath":"/jobs/22166215/assistant-professor-in-comparative-disease-computation-and-artificial-intelligence-tenure-track","source":"naylor","job":"22166215","jobTitle":"Assistant Professor in Comparative Disease Computation and Artificial Intelligence (Tenure Track)"},"22215672":{"jobPath":"/jobs/22215672/chief-operating-officer-coo","source":"naylor","job":"22215672","jobTitle":"Chief Operating Officer (COO)"},"22221449":{"jobPath":"/jobs/22221449/general-dentist","source":"naylor","job":"22221449","jobTitle":"General Dentist"},"22223728":{"jobPath":"/jobs/22223728/senior-director-institutional-valuation","source":"naylor","job":"22223728","jobTitle":"Senior Director - Institutional Valuation"},"21896810":{"jobPath":"/jobs/21896810/quantitative-researcher","source":"naylor","job":"21896810","jobTitle":"Quantitative Researcher"},"22221446":{"jobPath":"/jobs/22221446/general-dentist","source":"naylor","job":"22221446","jobTitle":"General Dentist"},"22223601":{"jobPath":"/jobs/22223601/postdoc-stem-cell-disease-modeling-and-precision-medicine-innovative-genomics-institute","source":"naylor","job":"22223601","jobTitle":"Postdoc - Stem Cell Disease Modeling and Precision Medicine - Innovative Genomics Institute"},"22221447":{"jobPath":"/jobs/22221447/general-dentist","source":"naylor","job":"22221447","jobTitle":"General Dentist"},"22223542":{"jobPath":"/jobs/22223542/associate-research-scientist-prep0003810","source":"naylor","job":"22223542","jobTitle":"Associate Research Scientist (PREP0003810)"},"22223742":{"jobPath":"/jobs/22223742/data-scientist","source":"naylor","job":"22223742","jobTitle":"Data Scientist"}}
The Senior Platform Data Engineer owns roadmap, priorities, platform standards, and architecture reviews; provides formal input on performance reviews. This position makes clinical data ready for AI at scale: owning the shared data products, retrieval infrastructure, and platform administration that the entire AI portfolio depends on. Owns Real-time data feeds. Reusable clinical data models and feature pipelines. RAG retrieval infrastructure (ingestion, chunking, embeddings, vector DB, retrieval pipelines). Databricks platform administration.
Job Duties
Streams data from Epic SDE, ADT feeds, lab results, and other clinical sources into Databricks for downstream model consumption.
Curates shared clinical feature tables (patient demographics, labs, vitals, diagnoses, utilization history, imaging metadata) in Databricks/Unity Catalog that multiple AI programs consume for model training, validation, and monitoring.
Owns RAG Infrastructure, the shared retrieval-augmented generation platform that agentic and generative AI programs use to ground LLM outputs in organizational knowledge.
Designs and operates document ingestion pipelines: normalizing clinical documents, policies, guidelines, and unstructured data sources into formats ready for embedding and retrieval.
Implements and optimizes chunking strategies tailored to healthcare content (e.g., preserving clinical note structure, section-aware chunking for guidelines and protocols).
Manages the embedding pipeline: selecting, tuning, and versioning embedding models (domain-specific clinical models where they outperform general-purpose).
Administers the vector database: schema design, indexing, metadata management, access controls, and performance tuning.
Builds and maintains retrieval pipelines: hybrid search (vector + keyword/BM25), reranking, and relevance filtering to maximize retrieval precision for downstream agents and LLM applications.
Establishes data quality gates for RAG: automated profiling, completeness checks, and accuracy scoring before content enters the vector store.
Databricks workspace configuration and Unity Catalog governance.
Cluster policies, compute management, and cost monitoring.
Manges user/group management and access control.
Administrator for Feature Store.
Work is typically performed in an office environment. Accountable for satisfying all job specific obligations and complying with all organization policies and procedures. The specific statements in this profile are not intended to be all-inclusive. They represent typical elements considered necessary to successfully perform the job.
*Relevant experience may be a combination of related work experience and degree obtained (Master's Degree = 2 years).
Position Details
Key Technologies:
Databricks (Delta Live Tables, Feature Store, PySpark, Unity Catalog)
Epic SDE / epic-ws for real-time clinical data extraction
Vector databases (Pinecone, Weaviate, Qdrant, or Databricks Vector Search)
Embedding models and pipelines (clinical domain-specific and general-purpose)
SQL, pandas
Streaming and batch ingestion patterns
CDIS Data Warehouse (source system for batch clinical data)
Required Skills & Qualifications:
5+ years in data engineering, with strong experience building both batch and streaming data pipelines
Expert-level Databricks skills: Delta Live Tables, PySpark, Unity Catalog, Feature Store
Hands-on experience with real-time data ingestion (Kafka, Spark Structured Streaming, or comparable frameworks)
Strong SQL and Python (pandas, PySpark) skills for data transformation and feature engineering
Familiarity with clinical data models and healthcare data sources (EHR extracts, ADT feeds, lab results, claims data) strongly preferred
Experience with Epic data extraction methods (SDE, FHIR, epic-ws) a significant plus
Understanding of data governance principles: lineage, quality monitoring, access controls
Education
Bachelor's Degree-Related Field of Study (Required), Master's Degree-Related Field of Study (Preferred)
Experience
Minimum of 5 years-Relevant experience* (Required)
Certification(s) and License(s)
OUR PURPOSE & VALUES: Everything we do is about caring for our patients, our members, our students, our Geisinger family and our communities. KINDNESS: We strive to treat everyone as we would hope to be treated ourselves. EXCELLENCE: We treasure colleagues who humbly strive for excellence. LEARNING: We share our knowledge with the best and brightest to better prepare the caregivers for tomorrow. INNOVATION: We constantly seek new and better ways to care for our patients, our members, our community, and the nation. SAFETY: We provide a safe environment for our patients and members and the Geisinger family We offer healthcare benefits for full time and part time positions from day one, including vision, dental and domestic partners. Perhaps just as important, from senior management on down, we encourage an atmosphere of collaboration, cooperation and collegiality. We know that a diverse workforce with unique experiences and backgrounds makes our team stronger. Our patients, members and community come from a wide variety of backgrounds, and it takes a diverse workforce to make better health easier for all. We are proud to be an affirmative action, equal opportunity employer and all qualified applicants will receive consideration for employment regardless to race, color, religion, sex, sexual orientation, gender identity, national origin, disability or status as a protected veteran.
We are an Affirmative Action, Equal Opportunity Employer Women and Minorities are Encouraged to Apply. All qualified applicants will receive consideration for employment and will not be discriminated against on the basis of disability or their protected veteran status.
At Geisinger, our innovative ideas are inspired by the communities we serve – like our Fresh FoodFarmacy, a program that delivers life-saving healthy alternatives to patients with diabetes. With additional tools like our MyCode Community Health Initiative, one of the first health system genome sequencingprograms, and our new asthma app suite that we developed in partnership with AstraZeneca, it’s no wonder we’re ranked one of the Top 5 Most Innovative Healthcare Systems by Becker's Hospital Review. We continually work towards continuous improvement in a culture where everyone has a voice and firmly believe that better begins with all of us.Founded more than 100 years ago, Geisinger serves more than three million residents throughout central, south-central and northeastern Pennsylvania and southern New Jersey. Our physician-led system is comprised of 30,000 employees, including 1,600 employed physicians, and consists of 1...3 hospital campuses, the Geisinger Health Plan, Geisinger Commonwealth School of Medicine and two research centers. What you do at Geisinger shapes the future of health and improves lives – for our patients, communities, and you.