- Career Center Home
- Search Jobs
- Postdoctoral Researcher in AI-Driven Data Curation and Data Integration
Results
Job Details
Explore Location
University of Pennsylvania
Philadelphia, Pennsylvania, United States
(on-site)
Posted
11 days ago
University of Pennsylvania
Philadelphia, Pennsylvania, United States
(on-site)
Job Type
Full-Time
Job Function
Research
Postdoctoral Researcher in AI-Driven Data Curation and Data Integration
The insights provided are generated by AI and may contain inaccuracies. Please independently verify any critical information before relying on it.
Postdoctoral Researcher in AI-Driven Data Curation and Data Integration
The insights provided are generated by AI and may contain inaccuracies. Please independently verify any critical information before relying on it.
Description

Postdoctoral Researcher in AI-Driven Data Curation and Data Integration
Location: University of Pennsylvania - Perelman School of Medicine
Open Date: Jun 02, 2026
Deadline:
Faculty Mentor: Joost Wagenaar
Department: Informatics
Number of Positions: 2
Open to applications from US Citizens and foreign nationals.
The Wagenaar Lab is seeking a highly motivated Postdoctoral Researcher to conduct research at the intersection of artificial intelligence, common data elements (CDEs), and large-scale biomedical datasets. The Wagenaar Lab is jointly based in the Institute for Biomedical Informatics and the Department of Biostatistics, Epidemiology, and Informatics at the University of Pennsylvania, and leads the academic development of the Pennsieve scientific data platform. The lab's mission is to create scalable, sustainable infrastructure that enables data integration, reuse, and discovery across clinical and scientific research domains.
This postdoctoral position will focus on developing AI-enabled methods to automate and augment data curation, with an emphasis on leveraging CDEs to improve the usability, interoperability, and scientific value of public datasets. The successful candidate will work across disease areas-including Epilepsy, Immune Health, and programs within the NIH HEAL Initiative-to design approaches that harmonize heterogeneous datasets, enrich metadata, and support scalable data exploration.
The Postdoctoral Researcher will work closely with the Pennsieve development team and a broad network of scientific collaborators to translate industry best practices in data engineering and AI into the academic research ecosystem. A central goal of this role is to move beyond manual, project-specific curation toward reproducible, automated, and extensible curation workflows that can be applied across datasets, programs, and institutions.
In addition to platform and method development, the Postdoctoral Researcher is expected to contribute to peer-reviewed publications, open-source software, and community-facing resources that advance AI-enabled data stewardship and reuse.
Responsibilities
- Develop AI-based methods and deploy them at scale to automate and augment data curation using Common Data Elements
- Design workflows to harmonize, validate, and enrich public datasets across Epilepsy, Immune Health, and NIH HEAL programs
- Develop novel mechanisms to interrogate, visualize and interact with complex scientific datasets and increase the value of these datasets for the scientific community.
- Integrate curation methods into scalable, cloud-based scientific data platforms
- Collaborate with the Pennsieve development team and scientific partners to align methods with real research workflows
- Evaluate and validate curation approaches using large, heterogeneous public datasets
- Prepare manuscripts, technical documentation, and presentations describing methods and outcomes.
Qualifications
Qualifications
- Ph.D. (preferred) or Master's degree in Biomedical Informatics, Computer Science, Data Science, Bioinformatics, or a related field
- Experience with machine learning, natural language processing, or AI applied to structured and unstructured data
- Familiarity with Common Data Elements, data standards, or ontology-based data representation (preferred)
- Strong programming skills in Docker, Python, Go, Java, or related languages
- Experience working with large-scale biomedical or clinical datasets
- Experience with cloud-based data processing and scalable analytics environments (AWS preferred)
- Strong written and verbal communication skills and an interest in interdisciplinary collaboration
Application Instructions
Please include a cover letter and a CV for consideration.
To apply, visit https://apply.interfolio.com/187547
Equal Employment Opportunity Statement
The University of Pennsylvania is an equal opportunity employer. Candidates are considered for employment without regard to race, color, sex, sexual orientation, religion, creed, national origin (including shared ancestry or ethnic characteristics), citizenship status, age, disability, veteran status or any class protected under applicable federal, state, or local law.
Copyright 2025 Jobelephant.com Inc. All rights reserved.
Posted by the FREE value-added recruitment advertising agency jeid-c846cb503bf30841911157297cf3ee2d
Job ID: 84560322
Jobs You May Like
Median Salary
Net Salary per month
$4,505
Cost of Living Index
78/100
78
Median Apartment Rent in City Center
(1-3 Bedroom)
$1,927
-
$3,705
$2,816
Safety Index
34/100
34
Utilities
Basic
(Electricity, heating, cooling, water, garbage for 915 sq ft apartment)
$155
-
$339
$239
High-Speed Internet
$50
-
$100
$75
Transportation
Gasoline
(1 gallon)
$3.37
Taxi Ride
(1 mile)
$2.51
Data is collected and updated regularly using reputable sources, including corporate websites and governmental reporting institutions.
Loading...