CVS loader (internship)
Context: Software Heritage is an ambitious research project whose goal is to collect, preserve in the very long term, and share the whole publicly accessible Free/Open Source Software (FOSS) in source code form.
Description: The Software Heritage archive currently contains source code coming from popular Version Control System (VCS) repositories publicly available on the Internet, such as Git, Subversion, and Mercurial. We want to extend the archive coverage to source code available from historically relevant VCS, and in particular CVS (Concurrent Versions System). The goal of this internship is to develop an automated "loader" that can be used to ingest into the archive source code available from CVS repositories.
Desirable skills to obtain this internship:
- familiarity with the Version Control Systems (VCS)
- Python development
- working knowledge of PostgreSQL would be a plus
Workplace: Inria Paris
Environment: you will work shoulder to shoulder with all members of the Software Heritage team, and you will have a chance to witness from within the construction of the great library of source code.
- Stefano Zacchiroli <email@example.com>