Integrate Software Heritage and ClearlyDefined (internship)

From Software Heritage Wiki
Revision as of 13:46, 2 February 2020 by StefanoZacchiroli (talk | contribs) (first draft, still incomplete)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

Context: Software Heritage is an ambitious research project whose goal is to collect, preserve in the very long term, and share the whole publicly accessible Free/Open Source Software (FOSS) in source code form.

Description: ClearlyDefined is a project whose goal is to collaboratively and semi-automatically curate information about Free/Open Source Software (FOSS) projects, including licensing and vulnerability information. As one of its main output, ClearyDefined maintains an open data knowledge-base that cross references FOSS source code artifacts found in version control systems, package repositories, etc. to curated information about their licenses and vulnerabilities. The same source code artifacts are archived by Software Heritage for long-term preservation purposes. The goal of this internship is to integrate ClearlyDefined and Software Heritage.

Desirable skills to obtain this internship:

  • Python development

Workplace: on site at Inria Paris (contact mentors for remote opportunities)

Environment: you will work shoulder to shoulder with all members of the Software Heritage team, and you will have a chance to witness from within the construction of the great library of source code.

Internship mentors:

  • Philippe Ombredanne <> (for ClearlyDefined)
  • Stefano Zacchiroli <> (for Software Heritage)

See also