The Institutional Data Initiative (IDI) is a new research center working to advance society?s relationship with knowledge by expanding access to, and deepening our understanding of, the data that underpins AI. By collaborating with library, government, and academic institutions to publish their knowledge collections as AI training sets, IDI seeks to 1) empower those institutions and the cultures they represent, 2) build a foundational pipeline for academic inquiry of AI, and 3) advance the state of the art for all builders of AI systems.
IDI?s work spans the AI data ecosystem?from digitization, data structuring, and metadata synthesis, to safety and security analysis, all the way through to benchmarking and the development of ethical and governance frameworks. Institutional collaboration forms the gateway to this work and IDI places a particular emphasis on opportunities with institutions that expand the cultural breadth of knowledge represented in the building blocks of AI.
At its core, IDI is a data practice around which other interdisciplinary work is convened. While theory and analysis are critical components of IDI?s work, our impact is a direct factor of our ability to ship novel data. As such, IDI?s workflows resemble those of a product studio. Our projects are time-bound and scopes are driven by ambition within time constraints. Our team structure is relatively flat and each member is expected to bring vision for their work and drive it through to completion. We prioritize interdisciplinary collaboration with academic contributors, both internal and external, as essential work that prevents the commodification of the data we help to publish.
The technical capabilities of our Principal Engineers define the depth of analysis and inquiry at IDI while developing and deploying repeatable methods and pipelines. The person in this role will have an ability to think creatively about extracting and manipulating data to unlock knowledge collections that have been stubbornly inaccessible, sometimes for centuries. Their understanding of machine learning and AI fundamentals will help identify areas of high impact and utilize models to facilitate this work. Each Principal Engineer must bring a unique set of skills and approaches to a team of engineers whose distinct capabilities complement the whole. This team works together to build an action plan for each corpus that takes it from uncharted territory to a well-defined map that others can traverse.
Beyond data, Principal Engineers also contribute to the building of community around IDI?s work to enable outside collaborators?fellow technologists, academics, students, cultural stakeholders?to expand our capabilities, capacity, and perspectives. IDI operates within Harvard and alongside the Library Innovation Lab, the Berkman Klein Center, and the Applied Social Media Lab; engaging these communities, among others, is critical to delivering on our mission.
As a Principal Engineer, you will:
Develop, refine and evaluate methods for analyzing and augmenting corpora.
Research, train and evaluate machine learning models.
Research, adjust and evaluate natural language processing techniques.
Write and contribute to open-source software.
Conduct an ongoing technology and research watch in areas pertinent to IDI?s focus, including: generative AI, natural language processing, digital preservation and open knowledge.
Provide technical leadership and guidance to both your team members and your project peers.
Help build and lead development of multiple discrete projects at once.
Draft technical and scientific communications outlining datasets and novel methodologies developed in the course of your work.
Be a technological ambassador for the research center. Help build and engage with the broader academic and AI communities including the Harvard student population.
Engage with partners to both share our work and explore new opportunities.
Basic Qualifications
Minimum of seven years? post-secondary education or relevant work experience
Additional Qualifications and Skills
We are looking for people who have:
A desire to create public-interest impact on AI.
Deep understanding and experience designing, implementing, testing, and documenting data workflows.
Advanced working knowledge of machine learning and ?AI? systems including local and open toolchains in addition to commercial offerings.
OCR and/or computer vision experience.
Strong grasp of at least one general purpose development technology (Python, Javascript, Lisp, ?) and of every day development tools (IDEs, Git, dependencies management, Linux & SSH, CI/CD, ?)
Experience drafting technical or scientific communications to document and disseminate their work.
A track record of shipping complex projects, especially working independently or within early-stage companies and organizations.
Candidates need not have all of these qualifications to be promising, but all promising candidates will have a strong track record of shipping.
Working Conditions
Travel is required for quarterly on-site meetings. Occasional travel for conferences and events as needed.
Additional Information
This is a two-year term appointment with potential for renewal subject to funding and departmental need.
Given the multidisciplinary nature of our work, we encourage a short cover letter to explain how your career trajectory and interests align with our work and mission.
We regret that Harvard Law School is unable to provide visa sponsorship for staff positions.
All offers to be made by HLS Human Resources
Benefits
We invite you to visit Harvard's Total Rewards website (https://hr.harvard.edu/totalrewards) to learn more about our outstanding benefits package, which may include:
Paid Time Off: 3-4 weeks of accrued vacation time per year (3 weeks for support staff and 4 weeks for administrative/professional staff), 12 accrued sick days per year, 12.5 holidays plus a Winter Recess in December/January, 3 personal days per year (prorated based on date of hire), and up to 12 weeks of paid leave for new parents who are primary care givers.
Health and Welfare: Comprehensive medical, dental, and vision benefits, disability and life insurance programs, along with voluntary benefits. Most coverage begins as of your start date.
Work/Life and Wellness: Child and elder/adult care resources including on campus childcare centers, Employee Assistance Program, and wellness programs related to stress management, nutrition, meditation, and more.
Retirement: University-funded retirement plan with contributions from 5% to 15% of eligible compensation, based on age and earnings with full vesting after 3 years of service.
Tuition Assistance Program: Competitive program including $40 per class at the Harvard Extension School and reduced tuition through other participating Harvard graduate schools.
Tuition Reimbursement: Program that provides 75% to 90% reimbursement up to $5,250 per calendar year for eligible courses taken at other accredited institutions.
Professional Development: Programs and classes at little or no cost, including through the Harvard Center for Workplace Development and LinkedIn Learning.
Commuting and Transportation: Various commuter options handled through the Parking Office, including discounted parking, half-priced public transportation passes and pre-tax transit passes, biking benefits, and more.
Harvard Facilities Access, Discounts and Perks: Access to Harvard athletic and fitness facilities, libraries, campus events, credit union, and more, as well as discounts to various types of services (legal, financial, etc.) and cultural and leisure activities throughout metro-Boston.
Work Format
Hybrid (partially on-site, partially remote)
Commitment to Equity, Diversity, Inclusion, and Belonging Harvard University views equity, diversity, inclusion, and belonging as the pathway to achieving inclusive excellence and fostering a campus culture where everyone can thrive. We strive to create a community that draws upon the widest possible pool of talent to unify excellence and diversity while fully embracing individuals from varied backgrounds, cultures, races, identities, life experiences, perspectives, beliefs, and values.
EEO Statement We are an equal opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, disability status, protected veteran status, gender identity, sexual orientation, pregnancy and pregnancy-related conditions, or any other characteristic protected by law.
Harvard University is devoted to excellence in teaching, learning, and research, and to developing leaders in many disciplines who make a difference globally. The University, which is based in Cambridge and Boston, Massachusetts, has an enrollment of over 20,000 degree candidates, including undergraduate, graduate, and professional students. Harvard has more than 360,000 alumni around the world. The University has twelve degree-granting Schools in addition to the Radcliffe Institute for Advanced Study, offering a truly global education. Established in 1636, Harvard is the oldest institution of higher education in the United States.