Difference between revisions of "Integrate Software Heritage and ClearlyDefined (internship)"

From Software Heritage Wiki
Jump to: navigation, search
 
(5 intermediate revisions by one other user not shown)
Line 11: Line 11:
  
 
|mentors=
 
|mentors=
* Jeff Mendoza <jemendoz@microsoft.com> (ClearlyDefined)
+
* Philippe Ombredanne <pombredanne@nexb.com> (nexB)
* Philippe Ombredanne <pombredanne@nexb.com> (ClearlyDefined)
+
* Valentin Lorentz
 
* Stefano Zacchiroli <zack@upsilon.cc> (Software Heritage)
 
* Stefano Zacchiroli <zack@upsilon.cc> (Software Heritage)
 
|workplace=
 
remote; on site at work at [https://www.inria.fr/en/centre-inria-de-paris Inria Paris] possible for local students
 
  
 
|environment=
 
|environment=
Line 22: Line 19:
 
}}
 
}}
  
[[Category:Available internship]]
+
== Clearlydefined ==
[[Category:Internship]]
+
 
[[Category:Lang:English]]
+
* [[ClearlyDefinedObject]]
 +
 
 +
[[Category:Ongoing internship]]

Latest revision as of 15:24, 1 January 2021

Context: Software Heritage is an ambitious research project whose goal is to collect, preserve in the very long term, and share the whole publicly accessible Free/Open Source Software (FOSS) in source code form.

Description: ClearlyDefined is a project whose goal is to collaboratively and semi-automatically curate information about Free/Open Source Software (FOSS) projects, including licensing and vulnerability information. As one of its main output, ClearyDefined maintains an open data knowledge-base that cross references FOSS source code artifacts found in version control systems, package repositories, etc. to curated information about their licenses and vulnerabilities. The same source code artifacts are archived by Software Heritage for long-term preservation purposes. The goal of this internship is to integrate ClearlyDefined and Software Heritage, for mutual benefit. Software Heritage will benefit from mirroring ClearlyDefined data, allowing to query them while navigating the archive and at scale; ClearlyDefined will benefit from learning about the existing of FOSS projects that have not been analyzed for "clarity" yet.

Desirable skills to obtain this internship:

  • JavaScript / NodeJS
  • Python
  • experience with database management systems (of any kind)

Workplace: on site at Inria Paris (contact mentors for remote opportunities)

Environment: you will work shoulder to shoulder with members of the Software Heritage and ClearlyDefined project, with mentors from both projects

Internship mentors:

  • Philippe Ombredanne <pombredanne@nexb.com> (nexB)
  • Valentin Lorentz
  • Stefano Zacchiroli <zack@upsilon.cc> (Software Heritage)

See also


Clearlydefined