Difference between revisions of "Expand archive coverage to Debian-based distros (internship)"

From Software Heritage Wiki
Jump to: navigation, search
(Created page with "== "Araignée" pour distributions basées sur Debian == '''Contexte''': [https://www.softwareheritage.org/ Software Heritage], projet de recherche de grande envergure ayant c...")
 
 
(12 intermediate revisions by 2 users not shown)
Line 1: Line 1:
== "Araignée" pour distributions basées sur Debian ==
+
== Sauvegarder (toutes) les distros basées sur Debian dans Software Heritage ==
 +
 
 +
(english description follows)
  
 
'''Contexte''': [https://www.softwareheritage.org/ Software Heritage], projet
 
'''Contexte''': [https://www.softwareheritage.org/ Software Heritage], projet
Line 6: Line 8:
 
accessible en format code source.
 
accessible en format code source.
  
'''Description''':
+
'''Description''': L'archive logiciel de Software Heritage contient actuellement
 
+
une copie complète et à jour de GitHub, mais seulement une selection ''ad hoc''
'''TODO'''
+
(même si assez large) des paquets logiciels de la distribution Debian. Le but de
 +
ce stage est de automatiser le processus de recuperation et d'injection des
 +
paquets Debian en format source (.dsc) dans l'archive de Software Heritage.
 +
L'objectif est de rendre trivial l'ajout de n'importe quelle distribution de
 +
Logiciel Libre [https://en.wikipedia.org/wiki/List_of_Linux_distributions#Debian-based basée sur Debian].
  
 
'''Connaissances souhaitées''' pour accéder au stage:
 
'''Connaissances souhaitées''' pour accéder au stage:
 
* connaissance de [https://www.debian.org/ Debian] ou d'une distribution basée sur Debian
 
* connaissance de [https://www.debian.org/ Debian] ou d'une distribution basée sur Debian
* Python
+
* environnement Linux
 +
* la familiarité avec Python et PostgreSQL pourra être un plus
  
 
'''Établissement d'accueil''': Inria Paris
 
'''Établissement d'accueil''': Inria Paris
  
 
'''Encadrants''':
 
'''Encadrants''':
 +
* Roberto Di Cosmo <roberto@dicosmo.org>
 +
* Stefano Zacchiroli <zack@upsilon.cc>
 +
 +
== Expand archive coverage to Debian-based distros ==
 +
 +
'''Context''': [https://www.softwareheritage.org/ Software Heritage] is an
 +
ambitious research project whose goal is to collect, preserve in the very long
 +
term, and share the whole publicly accessible Free/Open Source Software
 +
(FOSS) in source code form.
 +
 +
'''Description''': The Software Heritage archive currently contains a full,
 +
up-to-date mirror of GitHub, as well as an ''ad hoc'' selection (and a very
 +
big one) of software packages coming from the Debian distribution. The goal of
 +
this internship is to fully automate the process of collection and ingestion
 +
of Debian source packages (.dsc) into the Software Heritage archive. The main
 +
objective is to make it trivial the addition of anyone of the many FOSS
 +
distributions that are
 +
[https://en.wikipedia.org/wiki/List_of_Linux_distributions#Debian-based based on Debian].
 +
 +
'''Desirable skills''' to obtain this internship:
 +
* familiarity with [https://www.debian.org/ Debian] or other Debian-based distributions
 +
* GNU/Linux environment
 +
* Python
 +
* working knowledge of PostgreSQL would be a plus
 +
 +
'''Workplace''': Inria Paris
 +
 +
'''Environnement''': you will work shoulder to shoulder with all members of the
 +
Software Heritage team, and you will have a chance to witness from within the
 +
construction of the ultimate source code archive.
 +
 +
'''Internship mentors''':
 
* Roberto Di Cosmo <roberto@dicosmo.org>
 
* Roberto Di Cosmo <roberto@dicosmo.org>
 
* Stefano Zacchiroli <zack@upsilon.cc>
 
* Stefano Zacchiroli <zack@upsilon.cc>
  
  
[[Category:Available internship]]
+
[[Category:Completed internship]]
 
[[Category:Internship]]
 
[[Category:Internship]]
 
[[Category:Lang:French]]
 
[[Category:Lang:French]]
 +
[[Category:Lang:English]]

Latest revision as of 11:20, 20 January 2018

Sauvegarder (toutes) les distros basées sur Debian dans Software Heritage

(english description follows)

Contexte: Software Heritage, projet de recherche de grande envergure ayant comme but la récupération, l'archivage à très long terme, et le partage de la totalité du Logiciel Libre publiquement accessible en format code source.

Description: L'archive logiciel de Software Heritage contient actuellement une copie complète et à jour de GitHub, mais seulement une selection ad hoc (même si assez large) des paquets logiciels de la distribution Debian. Le but de ce stage est de automatiser le processus de recuperation et d'injection des paquets Debian en format source (.dsc) dans l'archive de Software Heritage. L'objectif est de rendre trivial l'ajout de n'importe quelle distribution de Logiciel Libre basée sur Debian.

Connaissances souhaitées pour accéder au stage:

  • connaissance de Debian ou d'une distribution basée sur Debian
  • environnement Linux
  • la familiarité avec Python et PostgreSQL pourra être un plus

Établissement d'accueil: Inria Paris

Encadrants:

  • Roberto Di Cosmo <roberto@dicosmo.org>
  • Stefano Zacchiroli <zack@upsilon.cc>

Expand archive coverage to Debian-based distros

Context: Software Heritage is an ambitious research project whose goal is to collect, preserve in the very long term, and share the whole publicly accessible Free/Open Source Software (FOSS) in source code form.

Description: The Software Heritage archive currently contains a full, up-to-date mirror of GitHub, as well as an ad hoc selection (and a very big one) of software packages coming from the Debian distribution. The goal of this internship is to fully automate the process of collection and ingestion of Debian source packages (.dsc) into the Software Heritage archive. The main objective is to make it trivial the addition of anyone of the many FOSS distributions that are based on Debian.

Desirable skills to obtain this internship:

  • familiarity with Debian or other Debian-based distributions
  • GNU/Linux environment
  • Python
  • working knowledge of PostgreSQL would be a plus

Workplace: Inria Paris

Environnement: you will work shoulder to shoulder with all members of the Software Heritage team, and you will have a chance to witness from within the construction of the ultimate source code archive.

Internship mentors:

  • Roberto Di Cosmo <roberto@dicosmo.org>
  • Stefano Zacchiroli <zack@upsilon.cc>