Code scanner (internship)

From Software Heritage Wiki
Revision as of 08:52, 22 October 2018 by StefanoZacchiroli (talk | contribs) (Created page with "== Compliance code scanner based on the Software Heritage archive == '''Context''': [https://www.softwareheritage.org/ Software Heritage] is an ambitious research project who...")
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search

Compliance code scanner based on the Software Heritage archive

Context: Software Heritage is an ambitious research project whose goal is to collect, preserve in the very long term, and share the whole publicly accessible Free/Open Source Software (FOSS) in source code form.

Description: Companies shipping software as part of their products review the source code they ship against databases of known FOSS components to make sure they are not shipping unexpected pieces of code. The goal of this internship is developing a source code scanner that will be run on a software project to determine which parts of it are already known/archived in the Software Heritage archive. The scanning should be as efficient as possible and the results should be displayed in simple graphical ways (e.g., treemaps).

Desirable skills to obtain this internship:

  • Python development

Workplace: Inria Paris

Environnement: you will work shoulder to shoulder with all members of the Software Heritage team, and you will have a chance to witness from within the construction of the ultimate source code archive.

Internship mentors:

  • Guillaume Rousseau <guillaume.rousseau@univ-paris-diderot.fr>
  • Stefano Zacchiroli <zack@upsilon.cc>