Difference between revisions of "Integrate Software Heritage and ClearlyDefined (internship)"

From Software Heritage Wiki
Jump to: navigation, search
(first draft, still incomplete)
 
 
(7 intermediate revisions by one other user not shown)
Line 2: Line 2:
 
|description=[https://clearlydefined.io/ ClearlyDefined] is a project whose goal is to collaboratively and semi-automatically curate information about Free/Open Source Software (FOSS) projects, including licensing and vulnerability information.
 
|description=[https://clearlydefined.io/ ClearlyDefined] is a project whose goal is to collaboratively and semi-automatically curate information about Free/Open Source Software (FOSS) projects, including licensing and vulnerability information.
 
As one of its main output, ClearyDefined maintains an open data knowledge-base that cross references FOSS source code artifacts found in version control systems, package repositories, etc. to curated information about their licenses and vulnerabilities. The same source code artifacts are archived by Software Heritage for long-term preservation purposes.
 
As one of its main output, ClearyDefined maintains an open data knowledge-base that cross references FOSS source code artifacts found in version control systems, package repositories, etc. to curated information about their licenses and vulnerabilities. The same source code artifacts are archived by Software Heritage for long-term preservation purposes.
The goal of this internship is to integrate ClearlyDefined and Software Heritage.
+
The goal of this internship is to integrate ClearlyDefined and Software Heritage, for mutual benefit.
 +
Software Heritage will benefit from mirroring ClearlyDefined data, allowing to query them while navigating the archive and at scale; ClearlyDefined will benefit from learning about the existing of FOSS projects that have not been analyzed for "clarity" yet.
  
 
|skills=
 
|skills=
* Python development
+
* JavaScript / NodeJS
 +
* Python
 +
* experience with database management systems (of any kind)
  
 
|mentors=
 
|mentors=
* Philippe Ombredanne <pombredanne@nexb.com> (for ClearlyDefined)
+
* Philippe Ombredanne <pombredanne@nexb.com> (nexB)
* Stefano Zacchiroli <zack@upsilon.cc> (for Software Heritage)
+
* Valentin Lorentz
 +
* Stefano Zacchiroli <zack@upsilon.cc> (Software Heritage)
 +
 
 +
|environment=
 +
you will work shoulder to shoulder with members of the [https://www.softwareheritage.org/people/ Software Heritage] and [https://clearlydefined.io/about ClearlyDefined project], with mentors from both projects
 
}}
 
}}
  
[[Category:Available internship]]
+
== Clearlydefined ==
[[Category:Internship]]
+
 
[[Category:Lang:English]]
+
* [[ClearlyDefinedObject]]
 +
 
 +
[[Category:Ongoing internship]]

Latest revision as of 15:24, 1 January 2021

Context: Software Heritage is an ambitious research project whose goal is to collect, preserve in the very long term, and share the whole publicly accessible Free/Open Source Software (FOSS) in source code form.

Description: ClearlyDefined is a project whose goal is to collaboratively and semi-automatically curate information about Free/Open Source Software (FOSS) projects, including licensing and vulnerability information. As one of its main output, ClearyDefined maintains an open data knowledge-base that cross references FOSS source code artifacts found in version control systems, package repositories, etc. to curated information about their licenses and vulnerabilities. The same source code artifacts are archived by Software Heritage for long-term preservation purposes. The goal of this internship is to integrate ClearlyDefined and Software Heritage, for mutual benefit. Software Heritage will benefit from mirroring ClearlyDefined data, allowing to query them while navigating the archive and at scale; ClearlyDefined will benefit from learning about the existing of FOSS projects that have not been analyzed for "clarity" yet.

Desirable skills to obtain this internship:

  • JavaScript / NodeJS
  • Python
  • experience with database management systems (of any kind)

Workplace: on site at Inria Paris (contact mentors for remote opportunities)

Environment: you will work shoulder to shoulder with members of the Software Heritage and ClearlyDefined project, with mentors from both projects

Internship mentors:

  • Philippe Ombredanne <pombredanne@nexb.com> (nexB)
  • Valentin Lorentz
  • Stefano Zacchiroli <zack@upsilon.cc> (Software Heritage)

See also


Clearlydefined