Difference between revisions of "Darcs loader (internship)"

From Software Heritage Wiki
Jump to navigation Jump to search
 
(3 intermediate revisions by the same user not shown)
Line 1: Line 1:
== Chargeur pour dépôts Mecurial et/ou Darcs ==
 
 
(english description follows)
 
 
'''Contexte''': [https://www.softwareheritage.org/ Software Heritage], projet
 
de recherche de grande envergure ayant comme but la récupération, l'archivage
 
à très long terme, et le partage de la totalité du Logiciel Libre publiquement
 
accessible en format code source.
 
 
'''Description''': L'archive logiciel de Software Heritage contient actuellement
 
code source provenant principalement de dépôts Git accessibles via Internet.
 
On souhaite supporter bien d'autres logiciel de gestion de versions, et en
 
priorité d'autres DVCS (Distributed Version Control System) populaires comme
 
[https://www.mercurial-scm.org/ Mercurial] et [http://darcs.net/ Darcs]. Le
 
but de ce stage est de développer les "chargeurs" (loaders) pour injecter dans
 
l'archive de Software Heritage du code source disponibles dans de dépôts
 
Mercurial et/ou Darcs.
 
 
'''Connaissances souhaitées''' pour accéder au stage:
 
* connaissance des logiciels de gestion de versions, en particulier [https://www.mercurial-scm.org/ Mercurial] et/ou [http://darcs.net/ Darcs]
 
* structures de donnees
 
* programmation
 
* bases de données
 
* la familiarité avec Python et PostgreSQL pourra être un plus
 
 
'''Établissement d'accueil''': Inria Paris
 
 
'''Environnement''': vous serez en immersion totale avec l'équipe qui construit l'archive de Software Heritage, et vous aurez la possibilité d'observer de près la construction d'un projet d'envergure mondiale.
 
 
'''Encadrants''':
 
* Roberto Di Cosmo <roberto@dicosmo.org>
 
* Stefano Zacchiroli <zack@upsilon.cc>
 
 
 
== Mercurial and Darcs loaders ==
 
 
 
'''Context''': [https://www.softwareheritage.org/ Software Heritage] is an
 
'''Context''': [https://www.softwareheritage.org/ Software Heritage] is an
 
ambitious research project whose goal is to collect, preserve in the very long
 
ambitious research project whose goal is to collect, preserve in the very long
Line 41: Line 5:
  
 
'''Description''': The Software Heritage archive currently contains source code
 
'''Description''': The Software Heritage archive currently contains source code
coming mostly from Git repositories publicly available on the Internet. We would
+
coming from popular Version Control System (VCS) repositories publicly available
like to extend the archive coverage to source code available from other popular
+
on the Internet, such as Git, Subversion, and Mercurial
Distributed Version Control Systems (DVCs), and in particular
+
We want to extend the archive coverage to source code available from historically
[https://www.mercurial-scm.org/ Mercurial] and [http://darcs.net/ Darcs].
+
relevant VCS, and in particular [http://darcs.net/ Darcs].
The goal of this internship is to develop automated "loaders" that can be used
+
The goal of this internship is to develop an automated "loader" that can be used
to ingest into the archive source code available from Mercurial and/or Darcs
+
to ingest into the archive source code available from Darcs repositories.
repositories.
 
  
 
'''Desirable skills''' to obtain this internship:
 
'''Desirable skills''' to obtain this internship:
* familiarity with the Distributed Version Control Systems (DVCs) [https://www.mercurial-scm.org/ Mercurial] and/or [http://darcs.net/ Darcs]
+
* familiarity with [https://en.wikipedia.org/wiki/Distributed_version_control Distributed Version Control Systems]
* graph data structures and algorithms
 
 
* Python development
 
* Python development
* RDBMs
 
 
* working knowledge of PostgreSQL would be a plus
 
* working knowledge of PostgreSQL would be a plus
  
'''Workplace''': Inria Paris
+
'''Workplace''': [https://www.inria.fr/en/centre-inria-de-paris Inria Paris]
  
'''Environnement''': you will work shoulder to shoulder with all members of the
+
'''Environment''': you will work shoulder to shoulder with all members of the
Software Heritage team, and you will have a chance to witness from within the
+
[https://www.softwareheritage.org/people/ Software Heritage team], and you will
construction of the ultimate source code archive.
+
have a chance to witness from within the construction of the great library of
 +
source code.
  
 
'''Internship mentors''':
 
'''Internship mentors''':
* Roberto Di Cosmo <roberto@dicosmo.org>
 
 
* Stefano Zacchiroli <zack@upsilon.cc>
 
* Stefano Zacchiroli <zack@upsilon.cc>
  
  
[[Category:Available internship]]
+
[[Category:Archived internship]]
 
[[Category:Internship]]
 
[[Category:Internship]]
[[Category:Lang:French]]
 
 
[[Category:Lang:English]]
 
[[Category:Lang:English]]

Latest revision as of 08:35, 29 January 2020

Context: Software Heritage is an ambitious research project whose goal is to collect, preserve in the very long term, and share the whole publicly accessible Free/Open Source Software (FOSS) in source code form.

Description: The Software Heritage archive currently contains source code coming from popular Version Control System (VCS) repositories publicly available on the Internet, such as Git, Subversion, and Mercurial We want to extend the archive coverage to source code available from historically relevant VCS, and in particular Darcs. The goal of this internship is to develop an automated "loader" that can be used to ingest into the archive source code available from Darcs repositories.

Desirable skills to obtain this internship:

Workplace: Inria Paris

Environment: you will work shoulder to shoulder with all members of the Software Heritage team, and you will have a chance to witness from within the construction of the great library of source code.

Internship mentors:

  • Stefano Zacchiroli <zack@upsilon.cc>