Data Management

Capturing Scholarship: Electronic Theses and Dissertations in the DRS

Northeastern’s electronic theses and dissertations (ETDs) provide a valuable record of the university’s scholarly contributions, capturing the evolution of research across numerous academic disciplines over the past two decades. The Digital Repository Service (DRS) preserves all ETDs from 2008 onward, along with selected earlier works, creating a collection of more than 7,500 items spanning over 30 departments and nearly 70 academic programs.

As some of the DRS’ most frequently accessed materials, ETDs offer rich insights into the university’s academic history and digital presence. To celebrate the 10th anniversary of the DRS, Digital Production Services (DPS) — the department responsible for managing both the DRS and ETDs — set out to share insights into how theses and dissertations are added to the repository and how Northeastern’s ETD collections have evolved over time.

ETD Creation to DRS Ingest: Process Overview

The ETDs are initially submitted to ProQuest by graduate students as a condition of their graduation. The rules for the submission package and document organization are determined by each program. Once the submission is completed and the student fills out information about their ETD, the file and metadata are sent in via a zip file to a library server. Over the last 5+ years, a local workflow has been developed to:

  1. Export the files and move backups to other networked drives
  2. Record submissions in a spreadsheet to ensure file provenance
  3. Document any additional information, such as embargo dates or original file names, in case there are issues with the submission
  4. Review, normalize, and transform the existing ProQuest metadata to create DRS-compliant records for each file
  5. Add degree, school, and department information to each record to support the DRS collection structure
  6. Ingest the ETDs into their corresponding collections in the DRS
  7. Generate digital object identifiers (DOIs) for each ETD
  8. Conduct name authority control on all advisor and committee member names
Screenshot of the DRS with the heading Theses and Dissertations, with several drop-down menus
Filtering options for ETDs in the DRS.

New ETDs are processed and ingested every 2-3 months, depending on the time of year and the volume of ETD submissions, and can involve anywhere from 30 to 100 ETDs at a time. DOIs are generated and ETD contributor names are reviewed bi-annually.

General Growth

The total number of ETDs submitted by Northeastern students has increased significantly since 2008. From 2008-2010, there was an average of around 190 documents submitted annually. As the 2010s continued, that number steadily increased from 353 in 2013 to 583 in 2019. There was a small dip in 2020, possibly due to COVID interrupting degree completions, but since then, there have been approximately 540-590 ETDs submitted each year.

Degree Distribution

Almost 90% of ETDs produced from 2008-2010 were either for Ph.D. or MS degrees, but as the School of Education started producing theses for the Ed.D. degree, those quickly became common, and represented 34% of all ETDs produced by 2020. Additional degree programs also started producing ETDs from 2010-2020, with MA, DLP, and MFA degrees representing almost 5% of ETDs during that period. In the last 4-5 years, numbers have stabilized, with Ph.D. dissertations regularly accounting for around 45% of all ETDs, Ed.D. theses around 35%, MS theses hovering around 15%, and all other degree types filling out the remaining 5%.

Line graph titled "ETD Submissions by Degree Type (2008-2024)
Data visualization showing ETD submissions by degree type from 2008-2014. Created by Claude (Antropic) based on analysis of dataset exported from the DRS and transformed by the author. Generated May 2025.

College, School, Department, and Program Representation

The early majority of ETDs produced by Northeastern students were from the College of Engineering (COE), which accounted for almost 62% from 2008-2010. Throughout the 2010s, other colleges emerged as significant contributors, including the Bouvé College of Health Sciences, the College of Professional Studies (CPS), and the College of Science (COS).

Line graph titled "ETD Submissions by College (2008-2024)
Data visualization showing ETD submission by college from 2008-2024. Created by Claude (Anthropic) based on analysis exported from the DRS and transformed by the author. Generated May 2025.

Within the College of Engineering, Electrical and Computer Engineering and Mechanical and Industrial Engineering remain the most prolific ETD producers, as well as the Chemistry and Chemical Biology program, the School of Education, and the Department of Art + Design.

The top 10 departments by total submission count:

  1. School of Education (2,143 submissions)
  2. Department of Electrical and Computer Engineering (910)
  3. Department of Mechanical and Industrial Engineering (705)
  4. Department of Chemistry and Chemical Biology (316)
  5. Department of Art + Design (271)
  6. Computer Science Program (245)
  7. Department of Civil and Environmental Engineering (242)
  8. School of Pharmacy (212)
  9. Department of Chemical Engineering (209)
  10. Department of Counseling and Applied Educational Psychology (202)

Addition of Supplementary Files

The first ETD to include supplemental files, or files submitted to accompany the ETD PDF file, first appeared in 2013. The number of supplemental files grew throughout the 2010s, with supplemental material representing 4% of all ETD file submissions during that time. Since 2020, the number of supplemental files has seen a slight decline, but there are still regular submissions, with 26 provided in 2024. The college that most often submits these files is the College of Arts, Media, and Design (CAMD), with almost 1 in 4 theses including supplemental materials.

Other notable contributors include COE and the College of Social Sciences and Humanities (CSSH). The smallest contributor is CPS, which, despite being the largest contributor of ETDs overall, has only 11 total supplemental files since 2013.

Screenshot of an item in the DRS titled "Supplemental file for 'Horn of plenty.'" A photo of a decorative green plant is on the left and metadata is listed on the right
Screenshot of a supplementary file page that features a photograph stored in the DRS. Original photo by Hannah M. Groudas.

New Undergraduate Theses

More recently, undergraduate programs from departments like Biology, Biochemistry, Marine and Environmental Science, and Psychology have begun to submit electronic theses directly to DPS staff. DPS offers the same level of service to the undergraduate theses as the graduate ETDs and includes the same metadata in each accompanying description to ensure these materials are as discoverable as the graduate theses and dissertations.

Maintaining ETDs is a vital part of the DRS’ mission, presenting unique challenges that library staff are well-equipped to manage. As the submission processes, file formats, academic disciplines, and research topics continue to evolve, the library remains committed to preserving and providing access to these scholarly works. Through ongoing innovation and stewardship, we ensure that the academic contributions and history of Northeastern students are securely archived and shared for generations to come.

AI acknowledgement: Claude Projects was used to generate data visualizations based on ETD metadata exported from the DRS and transformed into a spreadsheet dataset. Specific visualizations based on identified columns were requested. Project instructions, prompts, and dataset are available here.

Research Support Newsletter – Fall 2025

This blog was originally sent as a newsletter for Research Support Staff at Northeastern University on September 3, 2025. If you would like to subscribe to receive future newsletters, please click here.

Did you know the library can help with…your grant proposal?

Join us for our Accelerate Your Proposal Development event! This program is a countdown of proposal-related questions the library can help with, including personalized support for crafting data management and sharing plans, improving your data visualizations and graphics, strategies for efficient literature reviews, and citation management. We’ll share information about the tools and people who can help you develop key proposal components and supplementary materials. Whether you’re in the early stages of developing your proposal or fine-tuning it before submission, we’re happy to work with you.

This virtual event takes place Wednesday, October 29, from noon – 1 p.m. Eastern time. Register here.

Did you know we have access to…tools and services to complete evidence syntheses?

This month, we are highlighting two ways the library can support your evidence synthesis project. Evidence synthesis projects, which often do not require funding, can reveal important research gaps, thus strengthening future grant applications. If you are working on (or considering working on) a systematic review, scoping review, rapid review, or meta-analysis, read on!

Evidence Synthesis Service: Northeastern University Library provides a tiered set of support services for evidence synthesis projects such as systematic reviews, ranging from expert librarian guidance to full research partnerships. See our website and service tiers for more information.

Covidence: Covidence is a web-based evidence synthesis support tool that assists in screening references, data extraction, and keeping track of your work. Covidence requires registration with a Northeastern email address. If you already have an account, please sign in.

Start Smart — Foundations of Evidence Syntheses: Starting September 15, the library will be running a virtual workshop series for faculty and research staff on planning for and embarking on an evidence synthesis project.

Have any questions about completing evidence syntheses? Reach out to our expert, Philip Espinola Coombs.

We want to hear from you!

Research Data Storage Finder: We’re developing an interactive online tool to help researchers quickly narrow down the best platform for their data storage and archiving needs, and we’d love to hear what you think of what we’ve built so far. If you’d like to get a sneak peek and share your feedback, please let us know via this form.

That’s it!

Questions about the library? Email Alissa Link Cilfone, Head of STEM, or Jen Ferguson, Head of Research Data Services — we’d love to hear from you!

What is the DRS and who is it for?

What is the DRS?

The Digital Repository Service (DRS) is an institutional repository that was designed by the Northeastern University Library to help members of the Northeastern community organize, store, and share the digital materials that are important to their role or responsibilities at the university. This can include scholarly works created by faculty and students; supporting materials used in research; photographs and documents that represent the history of the community; or materials that support the day-to-day operations of the university.

While the DRS itself is a technical system that stores digital files and associated information to help users find what they need, we also consider the DRS to be a service for the university community: library staff are here to help you organize, store, share, and manage the digital materials that have long-lasting value for the university community and beyond.

Result listing in the DRS for a report titled "Exploring the Effectiveness of Bite-Sized Learning for Statistics via TikTok" and includes metadata and an image of the report
Published research from the Northeastern community available in the DRS.

Northeastern is not alone in this endeavor. Repository services are now standard practice for most academic institutions, including Harvard University Library (who also use the name “Digital Repository Service”), Stanford University Library (a leader in technical development for repository systems), Tufts Libraries, and other institutions around the world.

Who uses the DRS?

The DRS has been used by faculty, staff, students, and researchers from all corners of the university community for 10 years. There are too many use cases to mention in one brief blog post, but here are some trends we’ve seen in what users choose to deposit the last few years.

  • Open access copies of research publications, as well as working papers and technical reports
  • Publications and data that supports published research
  • Event recordings, photographs, newspapers, and almost any kind of material you can think of to support the day-to-day operations and activity at the university
  • Student research projects and classwork, like oral histories and research projects. Students are also required to contribute their final version of their thesis or dissertation.
  • Digitized and born-digital records from the Archives and Special Collections, including photographs, documents, and audio and video recordings

These files, and all the other audio, video, document, and photograph files in the DRS, have been viewed or downloaded 11.2 million times since the DRS first launched in 2015. Nearly half of the files in the DRS are made available to the public and are therefore available for the wider world to discover. Materials in the DRS have been cited in reporting by CNN, Pitchfork, WBUR, and Atlas Obscura, among others, and are regularly shared on social media or in Reddit threads. As a result, Northeastern continues to contribute the work produced here to the larger scholarly and cultural record, and to the larger world.

Who supports the DRS?

The day-to-day work managing, maintaining, and supporting users of the service comes from staff in Digital Production Services:

  • Kim Kennedy supervises the digitization of physical materials and processing of born-digital and digitized materials.
  • Drew Facklam and Emily Allen create and maintain the descriptive metadata that helps you find what you need.
  • And all of us in the department, including part-time staff, are responsible for general management of the system, including batch ingesting materials, holding consultations and training sessions, answering questions, and leading conversations about how to improve the system and the service.
Two people stand in front of a presentation with a screenshot of the DRS behind them
Sarah Sweeney and David Cliff, DRS staff, posing in 2015 with the homepage of the recently launched DRS. 

The DRS is also supported by a number of library staff members across the library:

  • David Cliff, Senior Digital Library Developer in Digital Infrastructures, is the DRS’ lead developer and system administrator.
  • Ernesto Valencia and Rob Chavez from the Library Technology Services and Infrastructure departments also provide development support and system administration.
  • Many librarians in the Research and Instruction department do outreach about the service and support faculty as they figure out how to use it in their work.
  • Jen Ferguson from Research Data Services also connects faculty and researchers to the DRS, while also providing data management support for those wishing to use the DRS to store their data.
  • Members of the library administration, including Dan Cohen, Evan Simpson, Tracey Harik, and the recently retired Patrick Yott have contributed their unwavering support and advocacy for developing and maintaining system an service.

We are all here to help you figure out how the DRS may be used to make your work and academic life easier. To dive deeper into what the DRS is and how to use it, visit the DRS subject guide or contact me or my team.

The library is celebrating 10 years of the DRS! Check out A Decade of the Digital Repository Service to read more about the history of the DRS.

A Decade of the Digital Repository Service

Northeastern University Library’s institutional repository, the Digital Repository Service, is celebrating 10 years of caring for the university’s scholarly, archival, and administrative high-value materials. From day one, the mission of the DRS has been to provide a long-term, sustainable home for the born digital and digitized content being produced by members of the Northeastern community.

More than just a technical system, the DRS is a service provided by the library to help solve a common problem for faculty, staff, students, researchers, and project teams: where can I store the digital output from my work? The DRS allows these projects developed at Northeastern to be maintained and shared with a wider audience. In addition to maintaining the DRS system, services provided by DRS staff include running training sessions, answering questions, consulting, and depositing files for users.

Originally developed as a prototype in 2011, the system was created by a library team — three developers, the repository manager, a Northeastern co-op, and a library administrator — with the goal of constructing a completely realized system ready for production. The first version was ready to be used fully by the Northeastern community in June 2015.

The DRS was launched with some rough edges, which were slowly smoothed into the system users are familiar with today. We have received tremendous response from users about the usefulness of the system, as well as thoughtful and constructive feedback about how the system can be improved (e.g. faster page load times, better search functionality, and more control over files, among others).

The DRS homepage displayed on a laptop screen with a hand typing on the computer's keyboard
The DRS, as it appeared in 2015.

We have done our best to grow with the university community as its needs shift by increasing support for datasets, loading large batches of files on behalf of users and project teams, and tripling our original storage capacity, but there is always more to be done to meet the needs of our users.

The shape of the content stored in the DRS has shifted over the years, as well. Initially just for theses and dissertations, university photographs, and archival material, the DRS now fully supports various types of project materials for digital humanities research, datasets for researchers in various disciplines, oral histories, and many others.

Since its launch, DRS content has been viewed, downloaded, or streamed more than 1.1 million times, and we’ve had more than 13,000 members of the Northeastern community sign into the system. The DRS averages approximately 2,000 unique visitors and 4,000 views, downloads, and streams a day.

Screenshot of a DRS display of a research poster titled "Investigating and addressing the needs of research support staff"
The DRS provides a home for and access to research and projects by members of the Northeastern community.

The success of the system can be attributed to the combined efforts of staff in many library departments, including development and system administration from Library Technology Services and Digital Infrastructures; outreach and faculty support from Research and Instruction; data management support from Research Data Services; issue triage and metadata collaboration with Resource and Discovery Services; and continual support and advocacy from library administration. And, of course, Digital Production Services, the department primarily responsible for maintaining the system and supporting the service through digital production, metadata maintenance, and user support.

The DRS is not the first system of its kind supported by the library. It adopted its first repository system in the early 2000s, followed by IRis in 2007. The library’s commitment to maintaining the scholarly output of the university was formed during those early years, a commitment we have refined and strengthened over the more than 20 years of dedicated support for faculty, staff, and students working to help fulfill the university’s mission. It’s been a great pleasure to support the Northeastern community in this way, and we look forward to the next 10 years and beyond.