Difference between revisions of "Ingest Wikidata software origins (internship)"

From Software Heritage Wiki
Jump to navigation Jump to search
(add IRC nicknames)
m
 
Line 14: Line 14:
  
 
|mentors=
 
|mentors=
* Stefano Zacchiroli <zack@upsilon.cc> (zack on [[IRC]])
+
* TBD (ask on [[Matrix]])
 
}}
 
}}
  
 
[[Category:Available internship]]
 
[[Category:Available internship]]

Latest revision as of 15:09, 4 February 2024

Context: Software Heritage is an ambitious initiative whose goal is to collect, preserve forever, and make publicly available the entire body of software, in the preferred form for making modifications to it.

Description: The Software Heritage archive currently contains source code coming mostly from major development forges and distributions. Wikidata is a free and open knowledge base about everything, including software development projects. The goal of this internship is to list software origins described in Wikidata (in particular, but not only, version control system) and make sure they get periodically crawled and ingested into the Software Heritage archive.

Desirable skills to obtain this internship:

  • familiarity with the Version Control Systems
  • familiarity with Wikipedia and/or Wikidata
  • Python development

Workplace: on site at Inria Paris (contact mentors for remote opportunities)

Environment: you will work shoulder to shoulder with all members of the Software Heritage team, and you will have a chance to witness from within the construction of the great library of source code.

Internship mentors:

See also