<?xml version="1.0"?>
<feed xmlns="http://www.w3.org/2005/Atom" xml:lang="en">
	<id>https://wiki.softwareheritage.org/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Archit+agrawal</id>
	<title>Software Heritage Wiki - User contributions [en]</title>
	<link rel="self" type="application/atom+xml" href="https://wiki.softwareheritage.org/api.php?action=feedcontributions&amp;feedformat=atom&amp;user=Archit+agrawal"/>
	<link rel="alternate" type="text/html" href="https://wiki.softwareheritage.org/wiki/Special:Contributions/Archit_agrawal"/>
	<updated>2026-04-20T06:25:56Z</updated>
	<subtitle>User contributions</subtitle>
	<generator>MediaWiki 1.39.10</generator>
	<entry>
		<id>https://wiki.softwareheritage.org/index.php?title=Google_Summer_of_Code_2019/Increase_archive_coverage&amp;diff=1122</id>
		<title>Google Summer of Code 2019/Increase archive coverage</title>
		<link rel="alternate" type="text/html" href="https://wiki.softwareheritage.org/index.php?title=Google_Summer_of_Code_2019/Increase_archive_coverage&amp;diff=1122"/>
		<updated>2019-08-28T21:46:54Z</updated>

		<summary type="html">&lt;p&gt;Archit agrawal: /* Student: */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;===Title:===&lt;br /&gt;
'''Increase archive coverage''' &lt;br /&gt;
&lt;br /&gt;
=== Description:===&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
The goal of this project is to increase the archive coverage by making listers and loaders for different forges. &lt;br /&gt;
[https://docs.softwareheritage.org/devel/swh-lister/index.html#swh-lister Listers] are components that crawl the APIs of software forges (e.g., Bitbucket, GitHub, Sourceforge, ...) and return a list of the software available in it. Loaders take a bundle of software (tarball, Git repository ...) and load it into Software Heritage, by adapting it so that it matches the archive data model.&lt;br /&gt;
&lt;br /&gt;
===Student: === &lt;br /&gt;
Archit Agrawal&lt;br /&gt;
* [https://forge.softwareheritage.org/p/nahimilega/ Forge activity]&lt;br /&gt;
* [https://wiki.softwareheritage.org/wiki/Google_Summer_of_Code_2019/Increase_archive_coverage/Commit_list List Of Commits]&lt;br /&gt;
&lt;br /&gt;
=== Mentors:===&lt;br /&gt;
* Nicolas Dandrimont&lt;br /&gt;
* Antoine R. Dumont&lt;br /&gt;
&lt;br /&gt;
===Work Done:===&lt;br /&gt;
* '''Listers:'''&lt;br /&gt;
** Completed and merged&lt;br /&gt;
*** [https://forge.softwareheritage.org/rDLSfedfd73c8e4be8ce1d08b31c9a5cb99f9ca40fd6 Phabricator Lister]&lt;br /&gt;
*** [https://forge.softwareheritage.org/D1482 GNU Lister]&lt;br /&gt;
*** [https://forge.softwareheritage.org/rDLSa9a37a85bf9efac416cfdd152588bf01b7a063b2 CRAN Lister]&lt;br /&gt;
*** [https://forge.softwareheritage.org/D1584 Packagist Lister]&lt;br /&gt;
*** [https://forge.softwareheritage.org/D1610 CGit Lister]&lt;br /&gt;
** Did research on the methods that could be used to make following listers and made an implementation plan for the same&lt;br /&gt;
*** [https://forge.softwareheritage.org/T1734 Launchpad Lister]&lt;br /&gt;
*** [https://forge.softwareheritage.org/T1777 Rubygem Lister]&lt;br /&gt;
*** [https://forge.softwareheritage.org/T1718 NuGET(.NET) Lister]&lt;br /&gt;
*** [https://forge.softwareheritage.org/T1724 Maven Lister]&lt;br /&gt;
**  [https://forge.softwareheritage.org/rDLS08ade29e6de0616a3964360454ab52b58c082b75 Add tests to PyPI Lister]&lt;br /&gt;
** [https://forge.softwareheritage.org/rDLSf424f07c7e628eb7a19d25f4fdb749682d97a21f Refactor base tests for listers]&lt;br /&gt;
**  [https://forge.softwareheritage.org/D1441 Add documentation on *How to run a new lister*]&lt;br /&gt;
* '''Loaders:'''&lt;br /&gt;
** '''[https://forge.softwareheritage.org/T1389 Base Package Manager Loader]'''&lt;br /&gt;
*** Ingesting source code from package managers is a process somewhat similar for all of the package managers. This calls for a common base implementation for loading content from package managers into the archive. I worked on this idea, analysed the steps required to make a loader and the implementation of present package manager loader. Came up with the plan to implement the base loader and made the pass([https://forge.softwareheritage.org/D1694 D1694], [https://forge.softwareheritage.org/D1810 D1810], [https://forge.softwareheritage.org/D1811 D1811], [https://forge.softwareheritage.org/D1812 D1812], [https://forge.softwareheritage.org/D1813 D1813], [https://forge.softwareheritage.org/D1814 D1814], [https://forge.softwareheritage.org/D1744 D1744]). However, after the recommendation from my mentor, we changed the approach to make the base loader. Instead of making the whole base loader in one go, we decided to break it into multiple steps(3 steps) and follow the incremental approach.&lt;br /&gt;
**'''[https://forge.softwareheritage.org/D1824 GNU Loader]'''&lt;br /&gt;
*** As part of the first step towards the implementation of Base Loader, GNU Loader was implemented.&lt;br /&gt;
&lt;br /&gt;
===TO-DO:===&lt;br /&gt;
* Implement the Listers using the research done and the implementation plan made for Launchpad, Rubygem.&lt;br /&gt;
* Find the workarounds to solve the challenges in making the Maven and NuGET(.NET) Lister.&lt;br /&gt;
* Work on the remaining steps in order to complete the Base Package Manager Loader.&lt;br /&gt;
&lt;br /&gt;
=== Learnings: ===&lt;br /&gt;
Working in Software Heritage was a wholesome experience. I got to learn a new thing almost every day. Here is a few of the most prominent ones: &lt;br /&gt;
*Work on a huge codebase&lt;br /&gt;
*Plan and design before jumping to code&lt;br /&gt;
*Write clean and well-commented code&lt;br /&gt;
*Learn the difference between doing projects in college and in the industry(Spoiler Alert: '''A lot''')&lt;br /&gt;
*Multiple language integration in a python library (Used in CRAN Lister)&lt;br /&gt;
*Different programming methodologies explained to me by my mentors(eg [https://en.wikipedia.org/wiki/Test-driven_development TDD])&lt;br /&gt;
*Work with tools; DVCS (git), issue tracker (phabricator forge), containerization/virtualization (docker)&lt;br /&gt;
&lt;br /&gt;
=== Activity reports:===&lt;br /&gt;
* May 2019&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-05/msg00003.html Week 20 Second Week (Community Bonding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-05/msg00010.html Week 21 Third Week (Community Bonding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-05/msg00017.html Week 22 First Week (Coding)]&lt;br /&gt;
* June 2019&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00009.html Week 23 Second Week (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00016.html Week 24 Third Week  (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00026.html Week 25 Fourth Week (Coding)(Work Summary)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00033.html Week 26 Fifth Week  (First Evaluation)]&lt;br /&gt;
* July 2019&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-07/msg00003.html Week 27 Sixth Week   (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-07/msg00006.html Week 28 Seventh Week (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-07/msg00011.html Week 29 Eight Week (Coding)(Work Summary)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-07/msg00015.html Week 30 Nineth Week (Second Evaluation)]&lt;br /&gt;
* August 2019&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-08/msg00002.html Week 31 Tenth Week (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-08/msg00004.html Week 32 Eleventh Week (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-08/msg00008.html Week 33 Twelfth Week (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-08/msg00011.html Week 34 Thirteenth Week (Final Evaluation)]&lt;br /&gt;
&lt;br /&gt;
== Links ==&lt;br /&gt;
* [https://forge.softwareheritage.org/source/swh-lister/    Lister source code repository]&lt;br /&gt;
* [https://forge.softwareheritage.org/source/swh-loader-core/   Loader source code repository]&lt;br /&gt;
* see project [https://summerofcode.withgoogle.com/projects/#5658995887439872 on the GSoC portal]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Category: Google Summer of Code]]&lt;br /&gt;
[[Category: Google Summer of Code 2019]]&lt;/div&gt;</summary>
		<author><name>Archit agrawal</name></author>
	</entry>
	<entry>
		<id>https://wiki.softwareheritage.org/index.php?title=Google_Summer_of_Code_2019/Increase_archive_coverage/Commit_list&amp;diff=1121</id>
		<title>Google Summer of Code 2019/Increase archive coverage/Commit list</title>
		<link rel="alternate" type="text/html" href="https://wiki.softwareheritage.org/index.php?title=Google_Summer_of_Code_2019/Increase_archive_coverage/Commit_list&amp;diff=1121"/>
		<updated>2019-08-28T21:45:31Z</updated>

		<summary type="html">&lt;p&gt;Archit agrawal: Created page with &amp;quot;Phabricator Lister: * [https://forge.softwareheritage.org/rDLSfedfd73c8e4be8ce1d08b31c9a5cb99f9ca40fd6 fedfd73c8e4be8ce1d08b31c9a5cb99f9ca40fd6]  GNU Lister: * [https://forge....&amp;quot;&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;Phabricator Lister:&lt;br /&gt;
* [https://forge.softwareheritage.org/rDLSfedfd73c8e4be8ce1d08b31c9a5cb99f9ca40fd6 fedfd73c8e4be8ce1d08b31c9a5cb99f9ca40fd6]&lt;br /&gt;
&lt;br /&gt;
GNU Lister:&lt;br /&gt;
* [https://forge.softwareheritage.org/rDLS151f6cd2235cedd420174ffad01fc8030ee4a787 151f6cd2235cedd420174ffad01fc8030ee4a787]&lt;br /&gt;
* [https://forge.softwareheritage.org/rDLSebdb959823bc004ff1766e97a409561dc48dcf3d ebdb959823bc004ff1766e97a409561dc48dcf3d]&lt;br /&gt;
* [https://forge.softwareheritage.org/rDLS709ba8a6e55c630f6ba7ba17595c11a2584bacda 709ba8a6e55c630f6ba7ba17595c11a2584bacda]&lt;br /&gt;
* [https://forge.softwareheritage.org/rDLS7c6245e663e074b95cc3422b10bd02a4d7b707db 7c6245e663e074b95cc3422b10bd02a4d7b707db]&lt;br /&gt;
* [https://forge.softwareheritage.org/rDLSf76b96b825f9e308d2d37b9e26bf1f13c1f916c8 f76b96b825f9e308d2d37b9e26bf1f13c1f916c8]&lt;br /&gt;
&lt;br /&gt;
CRAN Lister&lt;br /&gt;
* [https://forge.softwareheritage.org/rDLSa9a37a85bf9efac416cfdd152588bf01b7a063b2 a9a37a85bf9efac416cfdd152588bf01b7a063b2]&lt;br /&gt;
* [https://forge.softwareheritage.org/rDLS5ea9d5ed392a12dc5558fe165b1cbf2c0dfbcbf0 5ea9d5ed392a12dc5558fe165b1cbf2c0dfbcbf0]&lt;br /&gt;
&lt;br /&gt;
Packagist Lister&lt;br /&gt;
* [https://forge.softwareheritage.org/rDLS5727f15cf32345101bd38423b14ea199f6eb52e9 5727f15cf32345101bd38423b14ea199f6eb52e9]&lt;br /&gt;
&lt;br /&gt;
CGit Lister&lt;br /&gt;
* [https://forge.softwareheritage.org/rDLSb972a2a88d25a5c63d6cbc39564c91f8062f9835 b972a2a88d25a5c63d6cbc39564c91f8062f9835]&lt;br /&gt;
* [https://forge.softwareheritage.org/rDLS7e3c79bb1d18e49ac446159066de33470528b809 7e3c79bb1d18e49ac446159066de33470528b809]&lt;br /&gt;
* [https://forge.softwareheritage.org/rDLS0bf24469b7e0ea2a3e5e9ea90cbfe690d1afb1d9 0bf24469b7e0ea2a3e5e9ea90cbfe690d1afb1d9]&lt;br /&gt;
&lt;br /&gt;
GNU Loader&lt;br /&gt;
* [https://forge.softwareheritage.org/rDLDBASE054ef894d0ae0842ee01e573c59b90ac8c2994f7 054ef894d0ae0842ee01e573c59b90ac8c2994f7]&lt;br /&gt;
* [https://forge.softwareheritage.org/rDLDBASEcb253bdae866cab9b593e3de2d8701a3b4280010 cb253bdae866cab9b593e3de2d8701a3b4280010]&lt;br /&gt;
&lt;br /&gt;
Miscellaneous&lt;br /&gt;
* [https://forge.softwareheritage.org/rCDFD179ce3202a58abeb3c88c8166169045265c41e3f 179ce3202a58abeb3c88c8166169045265c41e3f]&lt;br /&gt;
* [https://forge.softwareheritage.org/rCDFDdee708ec4a79dc6a7a2db4788d7ea2d40c0a309d dee708ec4a79dc6a7a2db4788d7ea2d40c0a309d]&lt;br /&gt;
* [https://forge.softwareheritage.org/rCDFD560df3bc13c8853b94e646a00177ccd9c6784ada 560df3bc13c8853b94e646a00177ccd9c6784ada]&lt;br /&gt;
* [https://forge.softwareheritage.org/rDLSb2c6ddc35b6b126990099d923e91a365727aeccb b2c6ddc35b6b126990099d923e91a365727aeccb]&lt;br /&gt;
* [https://forge.softwareheritage.org/rDLS8d1b5d2d2d92d305a8aa019e119a7260adb78c55 8d1b5d2d2d92d305a8aa019e119a7260adb78c55]&lt;br /&gt;
* [https://forge.softwareheritage.org/rDLSf424f07c7e628eb7a19d25f4fdb749682d97a21f f424f07c7e628eb7a19d25f4fdb749682d97a21f]&lt;br /&gt;
* [https://forge.softwareheritage.org/rDLS08ade29e6de0616a3964360454ab52b58c082b75 08ade29e6de0616a3964360454ab52b58c082b75]&lt;br /&gt;
* [https://forge.softwareheritage.org/rDLSf8a2ae866bc2115e1c0589f1a97902447260e7fd f8a2ae866bc2115e1c0589f1a97902447260e7fd]&lt;/div&gt;</summary>
		<author><name>Archit agrawal</name></author>
	</entry>
	<entry>
		<id>https://wiki.softwareheritage.org/index.php?title=Google_Summer_of_Code_2019/Increase_archive_coverage&amp;diff=1109</id>
		<title>Google Summer of Code 2019/Increase archive coverage</title>
		<link rel="alternate" type="text/html" href="https://wiki.softwareheritage.org/index.php?title=Google_Summer_of_Code_2019/Increase_archive_coverage&amp;diff=1109"/>
		<updated>2019-08-26T13:09:04Z</updated>

		<summary type="html">&lt;p&gt;Archit agrawal: /* Description: */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;===Title:===&lt;br /&gt;
'''Increase archive coverage''' &lt;br /&gt;
&lt;br /&gt;
=== Description:===&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
The goal of this project is to increase the archive coverage by making listers and loaders for different forges. &lt;br /&gt;
[https://docs.softwareheritage.org/devel/swh-lister/index.html#swh-lister Listers] are components that crawl the APIs of software forges (e.g., Bitbucket, GitHub, Sourceforge, ...) and return a list of the software available in it. Loaders take a bundle of software (tarball, Git repository ...) and load it into Software Heritage, by adapting it so that it matches the archive data model.&lt;br /&gt;
&lt;br /&gt;
===Student: === &lt;br /&gt;
Archit Agrawal&lt;br /&gt;
* [https://forge.softwareheritage.org/p/nahimilega/ Forge activity]&lt;br /&gt;
&lt;br /&gt;
=== Mentors:===&lt;br /&gt;
* Nicolas Dandrimont&lt;br /&gt;
* Antoine R. Dumont&lt;br /&gt;
&lt;br /&gt;
===Work Done:===&lt;br /&gt;
* '''Listers:'''&lt;br /&gt;
** Completed and merged&lt;br /&gt;
*** [https://forge.softwareheritage.org/rDLSfedfd73c8e4be8ce1d08b31c9a5cb99f9ca40fd6 Phabricator Lister]&lt;br /&gt;
*** [https://forge.softwareheritage.org/D1482 GNU Lister]&lt;br /&gt;
*** [https://forge.softwareheritage.org/rDLSa9a37a85bf9efac416cfdd152588bf01b7a063b2 CRAN Lister]&lt;br /&gt;
*** [https://forge.softwareheritage.org/D1584 Packagist Lister]&lt;br /&gt;
*** [https://forge.softwareheritage.org/D1610 CGit Lister]&lt;br /&gt;
** Did research on the methods that could be used to make following listers and made an implementation plan for the same&lt;br /&gt;
*** [https://forge.softwareheritage.org/T1734 Launchpad Lister]&lt;br /&gt;
*** [https://forge.softwareheritage.org/T1777 Rubygem Lister]&lt;br /&gt;
*** [https://forge.softwareheritage.org/T1718 NuGET(.NET) Lister]&lt;br /&gt;
*** [https://forge.softwareheritage.org/T1724 Maven Lister]&lt;br /&gt;
**  [https://forge.softwareheritage.org/rDLS08ade29e6de0616a3964360454ab52b58c082b75 Add tests to PyPI Lister]&lt;br /&gt;
** [https://forge.softwareheritage.org/rDLSf424f07c7e628eb7a19d25f4fdb749682d97a21f Refactor base tests for listers]&lt;br /&gt;
**  [https://forge.softwareheritage.org/D1441 Add documentation on *How to run a new lister*]&lt;br /&gt;
* '''Loaders:'''&lt;br /&gt;
** '''[https://forge.softwareheritage.org/T1389 Base Package Manager Loader]'''&lt;br /&gt;
*** Ingesting source code from package managers is a process somewhat similar for all of the package managers. This calls for a common base implementation for loading content from package managers into the archive. I worked on this idea, analysed the steps required to make a loader and the implementation of present package manager loader. Came up with the plan to implement the base loader and made the pass([https://forge.softwareheritage.org/D1694 D1694], [https://forge.softwareheritage.org/D1810 D1810], [https://forge.softwareheritage.org/D1811 D1811], [https://forge.softwareheritage.org/D1812 D1812], [https://forge.softwareheritage.org/D1813 D1813], [https://forge.softwareheritage.org/D1814 D1814], [https://forge.softwareheritage.org/D1744 D1744]). However, after the recommendation from my mentor, we changed the approach to make the base loader. Instead of making the whole base loader in one go, we decided to break it into multiple steps(3 steps) and follow the incremental approach.&lt;br /&gt;
**'''[https://forge.softwareheritage.org/D1824 GNU Loader]'''&lt;br /&gt;
*** As part of the first step towards the implementation of Base Loader, GNU Loader was implemented.&lt;br /&gt;
&lt;br /&gt;
===TO-DO:===&lt;br /&gt;
* Implement the Listers using the research done and the implementation plan made for Launchpad, Rubygem.&lt;br /&gt;
* Find the workarounds to solve the challenges in making the Maven and NuGET(.NET) Lister.&lt;br /&gt;
* Work on the remaining steps in order to complete the Base Package Manager Loader.&lt;br /&gt;
&lt;br /&gt;
=== Learnings: ===&lt;br /&gt;
Working in Software Heritage was a wholesome experience. I got to learn a new thing almost every day. Here is a few of the most prominent ones: &lt;br /&gt;
*Work on a huge codebase&lt;br /&gt;
*Plan and design before jumping to code&lt;br /&gt;
*Write clean and well-commented code&lt;br /&gt;
*Learn the difference between doing projects in college and in the industry(Spoiler Alert: '''A lot''')&lt;br /&gt;
*Multiple language integration in a python library (Used in CRAN Lister)&lt;br /&gt;
*Different programming methodologies explained to me by my mentors(eg [https://en.wikipedia.org/wiki/Test-driven_development TDD])&lt;br /&gt;
*Work with tools; DVCS (git), issue tracker (phabricator forge), docker&lt;br /&gt;
&lt;br /&gt;
=== Activity reports:===&lt;br /&gt;
* May 2019&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-05/msg00003.html Week 20 Second Week (Community Bonding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-05/msg00010.html Week 21 Third Week (Community Bonding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-05/msg00017.html Week 22 First Week (Coding)]&lt;br /&gt;
* June 2019&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00009.html Week 23 Second Week (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00016.html Week 24 Third Week  (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00026.html Week 25 Fourth Week (Coding)(Work Summary)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00033.html Week 26 Fifth Week  (First Evaluation)]&lt;br /&gt;
* July 2019&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-07/msg00003.html Week 27 Sixth Week   (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-07/msg00006.html Week 28 Seventh Week (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-07/msg00011.html Week 29 Eight Week (Coding)(Work Summary)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-07/msg00015.html Week 30 Nineth Week (Second Evaluation)]&lt;br /&gt;
* August 2019&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-08/msg00002.html Week 31 Tenth Week (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-08/msg00004.html Week 32 Eleventh Week (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-08/msg00008.html Week 33 Twelfth Week (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-08/msg00011.html Week 34 Thirteenth Week (Final Evaluation)]&lt;br /&gt;
&lt;br /&gt;
== Links ==&lt;br /&gt;
* [https://forge.softwareheritage.org/source/swh-lister/    Lister source code repository]&lt;br /&gt;
* [https://forge.softwareheritage.org/source/swh-loader-core/   Loader source code repository]&lt;br /&gt;
* see project [https://summerofcode.withgoogle.com/projects/#5658995887439872 on the GSoC portal]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Category: Google Summer of Code]]&lt;br /&gt;
[[Category: Google Summer of Code 2019]]&lt;/div&gt;</summary>
		<author><name>Archit agrawal</name></author>
	</entry>
	<entry>
		<id>https://wiki.softwareheritage.org/index.php?title=Google_Summer_of_Code_2019/Increase_archive_coverage&amp;diff=1106</id>
		<title>Google Summer of Code 2019/Increase archive coverage</title>
		<link rel="alternate" type="text/html" href="https://wiki.softwareheritage.org/index.php?title=Google_Summer_of_Code_2019/Increase_archive_coverage&amp;diff=1106"/>
		<updated>2019-08-24T17:58:48Z</updated>

		<summary type="html">&lt;p&gt;Archit agrawal: /* Activity reports: */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;===Title:===&lt;br /&gt;
'''Increase archive coverage''' &lt;br /&gt;
&lt;br /&gt;
=== Description:===&lt;br /&gt;
As Software Heritage works on archiving and sharing source code, one of the major tasks is to ingest the latest source code available in the database from time to time and from all the possible sources where you can fetch the source code using listers and ingest them using loaders. [https://docs.softwareheritage.org/devel/swh-lister/index.html#swh-lister Listers] are components that crawl the APIs of software forges (e.g., Bitbucket, GitHub, Sourceforge, ...) and return a list of the software available in it whereas [Loaders take a bundle of software (tarball, Git repository ...) and load it into Software Heritage, by adapting it so that it matches the archive data model. The goal of this project is to increase the archive coverage by making listers and loaders for different websites that which stores source code, so that Software Heritage can fetch as much source code as possible and store it in the database to preserve it for future generations.&lt;br /&gt;
&lt;br /&gt;
===Student: === &lt;br /&gt;
Archit Agrawal&lt;br /&gt;
* [https://forge.softwareheritage.org/p/nahimilega/ Forge activity]&lt;br /&gt;
&lt;br /&gt;
=== Mentors:===&lt;br /&gt;
* Nicolas Dandrimont&lt;br /&gt;
* Antoine R. Dumont&lt;br /&gt;
&lt;br /&gt;
===Work Done:===&lt;br /&gt;
* '''Listers:'''&lt;br /&gt;
** Completed and merged&lt;br /&gt;
*** [https://forge.softwareheritage.org/rDLSfedfd73c8e4be8ce1d08b31c9a5cb99f9ca40fd6 Phabricator Lister]&lt;br /&gt;
*** [https://forge.softwareheritage.org/D1482 GNU Lister]&lt;br /&gt;
*** [https://forge.softwareheritage.org/rDLSa9a37a85bf9efac416cfdd152588bf01b7a063b2 CRAN Lister]&lt;br /&gt;
*** [https://forge.softwareheritage.org/D1584 Packagist Lister]&lt;br /&gt;
*** [https://forge.softwareheritage.org/D1610 CGit Lister]&lt;br /&gt;
** Did research on the methods that could be used to make following listers and made an implementation plan for the same&lt;br /&gt;
*** [https://forge.softwareheritage.org/T1734 Launchpad Lister]&lt;br /&gt;
*** [https://forge.softwareheritage.org/T1777 Rubygem Lister]&lt;br /&gt;
*** [https://forge.softwareheritage.org/T1718 NuGET(.NET) Lister]&lt;br /&gt;
*** [https://forge.softwareheritage.org/T1724 Maven Lister]&lt;br /&gt;
**  [https://forge.softwareheritage.org/rDLS08ade29e6de0616a3964360454ab52b58c082b75 Add tests to PyPI Lister]&lt;br /&gt;
** [https://forge.softwareheritage.org/rDLSf424f07c7e628eb7a19d25f4fdb749682d97a21f Refractor base tests for listers]&lt;br /&gt;
**  [https://forge.softwareheritage.org/D1441 Add documentation on *How to run a new lister*]&lt;br /&gt;
* '''Loaders:'''&lt;br /&gt;
** '''[https://forge.softwareheritage.org/T1389 Base Package Manager Loader]'''&lt;br /&gt;
*** Ingesting source code from package managers is a process somewhat similar for all of the package managers. This calls for a common base implementation for loading content from package managers into the archive. I worked on this idea, analysed the steps required to make a loader and the implementation of present package manager loader. Came up with the plan to implement the base loader and made the pass([https://forge.softwareheritage.org/D1694 D1694], [https://forge.softwareheritage.org/D1810 D1810], [https://forge.softwareheritage.org/D1811 D1811], [https://forge.softwareheritage.org/D1812 D1812], [https://forge.softwareheritage.org/D1813 D1813], [https://forge.softwareheritage.org/D1814 D1814], [https://forge.softwareheritage.org/D1744 D1744]). However, after the recommendation from my mentor, we changed the approach to make the base loader. Instead of making the whole base loader in one go, we decided to break it into multiple steps(3 steps) and follow the incremental approach.&lt;br /&gt;
**'''[https://forge.softwareheritage.org/D1824 GNU Loader]'''&lt;br /&gt;
*** As part of the first step towards the implementation of Base Loader, GNU Loader was implemented. &lt;br /&gt;
&lt;br /&gt;
===TO-DO:===&lt;br /&gt;
* Implement the Listers using the research done and the implementation plan made for Launchpad, Rubygem.&lt;br /&gt;
* Find the workarounds to solve the challenges in making the Maven and NuGET(.NET) Lister.&lt;br /&gt;
* Work on the remaining steps in order to complete the Base Package Manager Loader.&lt;br /&gt;
&lt;br /&gt;
=== Learnings: ===&lt;br /&gt;
Working in Software Heritage was a wholesome experience. I got to learn a new thing almost every day. It would me injustice id I say I can account all my learnings in a section of a blog, however here are a list of few of most prominent once: &lt;br /&gt;
*Working on a huge codebase&lt;br /&gt;
*Plan and design before jumping to code&lt;br /&gt;
*Writing clean and well-commented code&lt;br /&gt;
*Difference between doing projects in college and in the industry(Spoiler Alert: '''A lot''')&lt;br /&gt;
*Multiple language integration in a python library (Used in CRAN Lister)&lt;br /&gt;
*Different programming methodologies explained to me by my mentors(eg [https://en.wikipedia.org/wiki/Test-driven_development TDD])&lt;br /&gt;
*Working with git and forge&lt;br /&gt;
*Working with Docker&lt;br /&gt;
&lt;br /&gt;
=== Activity reports:===&lt;br /&gt;
* May 2019&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-05/msg00003.html Week 20 Second Week (Community Bonding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-05/msg00010.html Week 21 Third Week (Community Bonding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-05/msg00017.html Week 22 First Week (Coding)]&lt;br /&gt;
* June 2019&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00009.html Week 23 Second Week (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00016.html Week 24 Third Week  (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00026.html Week 25 Fourth Week (Coding)(Work Summary)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00033.html Week 26 Fifth Week  (First Evaluation)]&lt;br /&gt;
* July 2019&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-07/msg00003.html Week 27 Sixth Week   (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-07/msg00006.html Week 28 Seventh Week (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-07/msg00011.html Week 29 Eight Week (Coding)(Work Summary)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-07/msg00015.html Week 30 Nineth Week (Second Evaluation)]&lt;br /&gt;
* August 2019&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-08/msg00002.html Week 31 Tenth Week (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-08/msg00004.html Week 32 Eleventh Week (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-08/msg00008.html Week 33 Twelfth Week (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-08/msg00011.html Week 34 Thirteenth Week (Final Evaluation)]&lt;br /&gt;
&lt;br /&gt;
== Links ==&lt;br /&gt;
* [https://forge.softwareheritage.org/source/swh-lister/    Lister source code repository]&lt;br /&gt;
* [https://forge.softwareheritage.org/source/swh-loader-core/   Loader source code repository]&lt;br /&gt;
* see project [https://summerofcode.withgoogle.com/projects/#5658995887439872 on the GSoC portal]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Category: Google Summer of Code]]&lt;br /&gt;
[[Category: Google Summer of Code 2019]]&lt;/div&gt;</summary>
		<author><name>Archit agrawal</name></author>
	</entry>
	<entry>
		<id>https://wiki.softwareheritage.org/index.php?title=Google_Summer_of_Code_2019/Increase_archive_coverage&amp;diff=1101</id>
		<title>Google Summer of Code 2019/Increase archive coverage</title>
		<link rel="alternate" type="text/html" href="https://wiki.softwareheritage.org/index.php?title=Google_Summer_of_Code_2019/Increase_archive_coverage&amp;diff=1101"/>
		<updated>2019-08-23T13:18:07Z</updated>

		<summary type="html">&lt;p&gt;Archit agrawal: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;===Title:===&lt;br /&gt;
'''Increase archive coverage''' &lt;br /&gt;
&lt;br /&gt;
=== Description:===&lt;br /&gt;
As Software Heritage works on archiving and sharing source code, one of the major tasks is to ingest the latest source code available in the database from time to time and from all the possible sources where you can fetch the source code using listers and ingest them using loaders. [https://docs.softwareheritage.org/devel/swh-lister/index.html#swh-lister Listers] are components that crawl the APIs of software forges (e.g., Bitbucket, GitHub, Sourceforge, ...) and return a list of the software available in it whereas [Loaders take a bundle of software (tarball, Git repository ...) and load it into Software Heritage, by adapting it so that it matches the archive data model. The goal of this project is to increase the archive coverage by making listers and loaders for different websites that which stores source code, so that Software Heritage can fetch as much source code as possible and store it in the database to preserve it for future generations.&lt;br /&gt;
&lt;br /&gt;
===Student: === &lt;br /&gt;
Archit Agrawal&lt;br /&gt;
* [https://forge.softwareheritage.org/p/nahimilega/ Forge activity]&lt;br /&gt;
&lt;br /&gt;
=== Mentors:===&lt;br /&gt;
* Nicolas Dandrimont&lt;br /&gt;
* Antoine R. Dumont&lt;br /&gt;
&lt;br /&gt;
===Work Done:===&lt;br /&gt;
* '''Listers:'''&lt;br /&gt;
** Completed and merged&lt;br /&gt;
*** [https://forge.softwareheritage.org/rDLSfedfd73c8e4be8ce1d08b31c9a5cb99f9ca40fd6 Phabricator Lister]&lt;br /&gt;
*** [https://forge.softwareheritage.org/D1482 GNU Lister]&lt;br /&gt;
*** [https://forge.softwareheritage.org/rDLSa9a37a85bf9efac416cfdd152588bf01b7a063b2 CRAN Lister]&lt;br /&gt;
*** [https://forge.softwareheritage.org/D1584 Packagist Lister]&lt;br /&gt;
*** [https://forge.softwareheritage.org/D1610 CGit Lister]&lt;br /&gt;
** Did research on the methods that could be used to make following listers and made an implementation plan for the same&lt;br /&gt;
*** [https://forge.softwareheritage.org/T1734 Launchpad Lister]&lt;br /&gt;
*** [https://forge.softwareheritage.org/T1777 Rubygem Lister]&lt;br /&gt;
*** [https://forge.softwareheritage.org/T1718 NuGET(.NET) Lister]&lt;br /&gt;
*** [https://forge.softwareheritage.org/T1724 Maven Lister]&lt;br /&gt;
**  [https://forge.softwareheritage.org/rDLS08ade29e6de0616a3964360454ab52b58c082b75 Add tests to PyPI Lister]&lt;br /&gt;
** [https://forge.softwareheritage.org/rDLSf424f07c7e628eb7a19d25f4fdb749682d97a21f Refractor base tests for listers]&lt;br /&gt;
**  [https://forge.softwareheritage.org/D1441 Add documentation on *How to run a new lister*]&lt;br /&gt;
* '''Loaders:'''&lt;br /&gt;
** '''[https://forge.softwareheritage.org/T1389 Base Package Manager Loader]'''&lt;br /&gt;
*** Ingesting source code from package managers is a process somewhat similar for all of the package managers. This calls for a common base implementation for loading content from package managers into the archive. I worked on this idea, analysed the steps required to make a loader and the implementation of present package manager loader. Came up with the plan to implement the base loader and made the pass([https://forge.softwareheritage.org/D1694 D1694], [https://forge.softwareheritage.org/D1810 D1810], [https://forge.softwareheritage.org/D1811 D1811], [https://forge.softwareheritage.org/D1812 D1812], [https://forge.softwareheritage.org/D1813 D1813], [https://forge.softwareheritage.org/D1814 D1814], [https://forge.softwareheritage.org/D1744 D1744]). However, after the recommendation from my mentor, we changed the approach to make the base loader. Instead of making the whole base loader in one go, we decided to break it into multiple steps(3 steps) and follow the incremental approach.&lt;br /&gt;
**'''[https://forge.softwareheritage.org/D1824 GNU Loader]'''&lt;br /&gt;
*** As part of the first step towards the implementation of Base Loader, GNU Loader was implemented. &lt;br /&gt;
&lt;br /&gt;
===TO-DO:===&lt;br /&gt;
* Implement the Listers using the research done and the implementation plan made for Launchpad, Rubygem.&lt;br /&gt;
* Find the workarounds to solve the challenges in making the Maven and NuGET(.NET) Lister.&lt;br /&gt;
* Work on the remaining steps in order to complete the Base Package Manager Loader.&lt;br /&gt;
&lt;br /&gt;
=== Learnings: ===&lt;br /&gt;
Working in Software Heritage was a wholesome experience. I got to learn a new thing almost every day. It would me injustice id I say I can account all my learnings in a section of a blog, however here are a list of few of most prominent once: &lt;br /&gt;
*Working on a huge codebase&lt;br /&gt;
*Plan and design before jumping to code&lt;br /&gt;
*Writing clean and well-commented code&lt;br /&gt;
*Difference between doing projects in college and in the industry(Spoiler Alert: '''A lot''')&lt;br /&gt;
*Multiple language integration in a python library (Used in CRAN Lister)&lt;br /&gt;
*Different programming methodologies explained to me by my mentors(eg [https://en.wikipedia.org/wiki/Test-driven_development TDD])&lt;br /&gt;
*Working with git and forge&lt;br /&gt;
*Working with Docker&lt;br /&gt;
&lt;br /&gt;
=== Activity reports:===&lt;br /&gt;
* May 2019&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-05/msg00003.html Week 20 Second Week (Community Bonding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-05/msg00010.html Week 21 Third Week (Community Bonding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-05/msg00017.html Week 22 First Week (Coding)]&lt;br /&gt;
* June 2019&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00009.html Week 23 Second Week (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00016.html Week 24 Third Week  (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00026.html Week 25 Fourth Week (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00033.html Week 26 Fifth Week  (First Evaluation)]&lt;br /&gt;
* July 2019&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-07/msg00003.html Week 27 Sixth Week   (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-07/msg00006.html Week 28 Seventh Week (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-07/msg00011.html Week 29 Eight Week (Coding)(Work Summary)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-07/msg00015.html Week 30 Nineth Week (Second Evaluation)]&lt;br /&gt;
* August 2019&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-08/msg00002.html Week 31 Tenth Week (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-08/msg00004.html Week 32 Eleventh Week (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-08/msg00008.html Week 33 Twelfth Week (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-08/msg00008.html Week 34 Thirteenth Week (Final Evaluation)]&lt;br /&gt;
&lt;br /&gt;
== Links ==&lt;br /&gt;
* [https://forge.softwareheritage.org/source/swh-lister/    Lister source code repository]&lt;br /&gt;
* [https://forge.softwareheritage.org/source/swh-loader-core/   Loader source code repository]&lt;br /&gt;
* see project [https://summerofcode.withgoogle.com/projects/#5658995887439872 on the GSoC portal]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Category: Google Summer of Code]]&lt;br /&gt;
[[Category: Google Summer of Code 2019]]&lt;/div&gt;</summary>
		<author><name>Archit agrawal</name></author>
	</entry>
	<entry>
		<id>https://wiki.softwareheritage.org/index.php?title=Google_Summer_of_Code_2019/Increase_archive_coverage&amp;diff=1100</id>
		<title>Google Summer of Code 2019/Increase archive coverage</title>
		<link rel="alternate" type="text/html" href="https://wiki.softwareheritage.org/index.php?title=Google_Summer_of_Code_2019/Increase_archive_coverage&amp;diff=1100"/>
		<updated>2019-08-23T13:08:55Z</updated>

		<summary type="html">&lt;p&gt;Archit agrawal: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;===Title: === &lt;br /&gt;
Increase archive coverage &lt;br /&gt;
&lt;br /&gt;
=== Description:===&lt;br /&gt;
As Software Heritage works on archiving and sharing source code, one of the major tasks is to ingest the latest source code available in the database from time to time and from all the possible sources where you can fetch the source code using listers and ingest them using loaders. [https://docs.softwareheritage.org/devel/swh-lister/index.html#swh-lister Listers] are components that crawl the APIs of software forges (e.g., Bitbucket, GitHub, Sourceforge, ...) and return a list of the software available in it whereas [Loaders take a bundle of software (tarball, Git repository ...) and load it into Software Heritage, by adapting it so that it matches the archive data model. The goal of this project is to increase the archive coverage by making listers and loaders for different websites that which stores source code, so that Software Heritage can fetch as much source code as possible and store it in the database to preserve it for future generations.&lt;br /&gt;
&lt;br /&gt;
===Student: === &lt;br /&gt;
Archit Agrawal&lt;br /&gt;
* [https://forge.softwareheritage.org/p/nahimilega/ Forge activity]&lt;br /&gt;
&lt;br /&gt;
=== Mentors:===&lt;br /&gt;
* Nicolas Dandrimont&lt;br /&gt;
* Antoine R. Dumont&lt;br /&gt;
&lt;br /&gt;
===Work Done:===&lt;br /&gt;
* '''Listers:'''&lt;br /&gt;
** Completed and merged&lt;br /&gt;
*** [https://forge.softwareheritage.org/rDLSfedfd73c8e4be8ce1d08b31c9a5cb99f9ca40fd6 Phabricator Lister]&lt;br /&gt;
*** [https://forge.softwareheritage.org/D1482 GNU Lister]&lt;br /&gt;
*** [https://forge.softwareheritage.org/rDLSa9a37a85bf9efac416cfdd152588bf01b7a063b2 CRAN Lister]&lt;br /&gt;
*** [https://forge.softwareheritage.org/D1584 Packagist Lister]&lt;br /&gt;
*** [https://forge.softwareheritage.org/D1610 CGit Lister]&lt;br /&gt;
** Did research on the methods that could be used to make following listers and made an implementation plan for the same&lt;br /&gt;
*** [https://forge.softwareheritage.org/T1734 Launchpad Lister]&lt;br /&gt;
*** [https://forge.softwareheritage.org/T1777 Rubygem Lister]&lt;br /&gt;
*** [https://forge.softwareheritage.org/T1718 NuGET(.NET) Lister]&lt;br /&gt;
*** [https://forge.softwareheritage.org/T1724 Maven Lister]&lt;br /&gt;
**  [https://forge.softwareheritage.org/rDLS08ade29e6de0616a3964360454ab52b58c082b75 Add tests to PyPI Lister]&lt;br /&gt;
** [https://forge.softwareheritage.org/rDLSf424f07c7e628eb7a19d25f4fdb749682d97a21f Refractor base tests for listers]&lt;br /&gt;
**  [https://forge.softwareheritage.org/D1441 Add documentation on *How to run a new lister*]&lt;br /&gt;
* '''Loaders:'''&lt;br /&gt;
** '''[https://forge.softwareheritage.org/T1389 Base Package Manager Loader]'''&lt;br /&gt;
*** Ingesting source code from package managers is a process somewhat similar for all of the package managers. This calls for a common base implementation for loading content from package managers into the archive. I worked on this idea, analysed the steps required to make a loader and the implementation of present package manager loader. Came up with the plan to implement the base loader and made the pass([https://forge.softwareheritage.org/D1694 D1694], [https://forge.softwareheritage.org/D1810 D1810], [https://forge.softwareheritage.org/D1811 D1811], [https://forge.softwareheritage.org/D1812 D1812], [https://forge.softwareheritage.org/D1813 D1813], [https://forge.softwareheritage.org/D1814 D1814], [https://forge.softwareheritage.org/D1744 D1744]). However, after the recommendation from my mentor, we changed the approach to make the base loader. Instead of making the whole base loader in one go, we decided to break it into multiple steps(3 steps) and follow the incremental approach.&lt;br /&gt;
**'''[https://forge.softwareheritage.org/D1824 GNU Loader]'''&lt;br /&gt;
*** As part of the first step towards the implementation of Base Loader, GNU Loader was implemented. &lt;br /&gt;
&lt;br /&gt;
===TO-DO:===&lt;br /&gt;
* Implement the Listers using the research done and the implementation plan made for Launchpad, Rubygem.&lt;br /&gt;
* Find the workarounds to solve the challenges in making the Maven and NuGET(.NET) Lister.&lt;br /&gt;
* Work on the remaining steps in order to complete the Base Package Manager Loader.&lt;br /&gt;
&lt;br /&gt;
=== Learnings: ===&lt;br /&gt;
Working in Software Heritage was a wholesome experience. I got to learn a new thing almost every day. It would me injustice id I say I can account all my learnings in a section of a blog, however here are a list of few of most prominent once: &lt;br /&gt;
*Working on a huge codebase&lt;br /&gt;
*Plan and design before jumping to code&lt;br /&gt;
*Writing clean and well-commented code&lt;br /&gt;
*Difference between doing projects in college and in the industry(Spoiler Alert: '''A lot''')&lt;br /&gt;
*Multiple language integration in a python library (Used in CRAN Lister)&lt;br /&gt;
*Different programming methodologies explained to me by my mentors(eg [https://en.wikipedia.org/wiki/Test-driven_development TDD])&lt;br /&gt;
*Working on git and forge&lt;br /&gt;
*Docker&lt;br /&gt;
===Experience:===&lt;br /&gt;
&lt;br /&gt;
=== Activity reports:===&lt;br /&gt;
* May 2019&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-05/msg00003.html Week 20 Second Week (Community Bonding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-05/msg00010.html Week 21 Third Week (Community Bonding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-05/msg00017.html Week 22 First Week (Coding)]&lt;br /&gt;
* June 2019&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00009.html Week 23 Second Week (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00016.html Week 24 Third Week  (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00026.html Week 25 Fourth Week (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00033.html Week 26 Fifth Week  (First Evaluation)]&lt;br /&gt;
* July 2019&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-07/msg00003.html Week 27 Sixth Week   (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-07/msg00006.html Week 28 Seventh Week (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-07/msg00011.html Week 29 Eight Week (Coding)(Work Summary)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-07/msg00015.html Week 30 Nineth Week (Second Evaluation)]&lt;br /&gt;
* August 2019&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-08/msg00002.html Week 31 Tenth Week (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-08/msg00004.html Week 32 Eleventh Week (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-08/msg00008.html Week 33 Twelfth Week (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-08/msg00008.html Week 34 Thirteenth Week (Final Evaluation)]&lt;br /&gt;
&lt;br /&gt;
== Links ==&lt;br /&gt;
* [https://forge.softwareheritage.org/source/swh-lister/    Lister source code repository]&lt;br /&gt;
* [https://forge.softwareheritage.org/source/swh-loader-core/   Loader source code repository]&lt;br /&gt;
* see project [https://summerofcode.withgoogle.com/projects/#5658995887439872 on the GSoC portal]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Category: Google Summer of Code]]&lt;br /&gt;
[[Category: Google Summer of Code 2019]]&lt;/div&gt;</summary>
		<author><name>Archit agrawal</name></author>
	</entry>
	<entry>
		<id>https://wiki.softwareheritage.org/index.php?title=Google_Summer_of_Code_2019/Increase_archive_coverage&amp;diff=1094</id>
		<title>Google Summer of Code 2019/Increase archive coverage</title>
		<link rel="alternate" type="text/html" href="https://wiki.softwareheritage.org/index.php?title=Google_Summer_of_Code_2019/Increase_archive_coverage&amp;diff=1094"/>
		<updated>2019-08-21T17:58:04Z</updated>

		<summary type="html">&lt;p&gt;Archit agrawal: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;* '''Title:''' Increase archive coverage&lt;br /&gt;
* '''Description:''' As Software Heritage works on archiving and sharing source code, one of the major tasks is to ingest the latest source code available in the database from time to time and from all the possible sources where you can fetch the source code using listers and ingest them using loaders. Listers are components that crawl the APIs of software forges (e.g., Bitbucket, Gitorious, Sourceforge, ...) and return a list of the software available in it whereas Loaders take a bundle of software (tarball, Git repository ...) and load it into Software Heritage, by adapting it so that it matches the archive data model. The goal of this project is to increase the archive coverage by making listers and loaders for different websites that which stores source code, so that Software Heritage can fetch as much source code as possible and store it in the database to preserve it for future generations.&lt;br /&gt;
* '''Student:''' Archit Agrawal&lt;br /&gt;
** [https://forge.softwareheritage.org/p/nahimilega/ Forge activity]&lt;br /&gt;
* '''Mentors:'''&lt;br /&gt;
** Nicolas Dandrimont&lt;br /&gt;
** Antoine R. Dumont&lt;br /&gt;
* '''Experience:'''&lt;br /&gt;
* '''What I did:'''&lt;br /&gt;
** '''Listers:'''&lt;br /&gt;
*** Completed and merged&lt;br /&gt;
**** [https://forge.softwareheritage.org/rDLSfedfd73c8e4be8ce1d08b31c9a5cb99f9ca40fd6 Phabricator Lister]&lt;br /&gt;
**** [https://forge.softwareheritage.org/D1482 GNU Lister]&lt;br /&gt;
**** [https://forge.softwareheritage.org/rDLSa9a37a85bf9efac416cfdd152588bf01b7a063b2 CRAN Lister]&lt;br /&gt;
**** [https://forge.softwareheritage.org/D1584 Packagist Lister]&lt;br /&gt;
**** [https://forge.softwareheritage.org/D1610 CGit Lister]&lt;br /&gt;
*** Did research on the methods that could be used to make following listers and made an implementation plan for the same&lt;br /&gt;
**** [https://forge.softwareheritage.org/T1734 Launchpad Lister]&lt;br /&gt;
**** [https://forge.softwareheritage.org/T1777 Rubygem Lister]&lt;br /&gt;
**** [https://forge.softwareheritage.org/T1718 NuGET(.NET) Lister]&lt;br /&gt;
**** [https://forge.softwareheritage.org/T1724 Maven Lister]&lt;br /&gt;
***  [https://forge.softwareheritage.org/rDLS08ade29e6de0616a3964360454ab52b58c082b75 Add tests to PyPI Lister]&lt;br /&gt;
*** [https://forge.softwareheritage.org/rDLSf424f07c7e628eb7a19d25f4fdb749682d97a21f Refractor base tests for listers]&lt;br /&gt;
***[https://forge.softwareheritage.org/D1441 Add documentation on *How to run a new lister*]&lt;br /&gt;
** '''Loaders:'''&lt;br /&gt;
*** Base Loader&lt;br /&gt;
*** GNU Loader&lt;br /&gt;
&lt;br /&gt;
* '''Learnings:'''&lt;br /&gt;
**How to work on a huge codebase&lt;br /&gt;
**How to plan and design before jumping to code&lt;br /&gt;
**Writing clean and well-commented code&lt;br /&gt;
**How are projects are made in the industry different from making projects in college(Spoiler Alert: '''A lot''')&lt;br /&gt;
**Multiple language integration in a python library (Used in CRAN Lister)&lt;br /&gt;
**Different programming methodologies explained to me by my mentors(eg [https://en.wikipedia.org/wiki/Test-driven_development TDD])&lt;br /&gt;
**Working on git and forge&lt;br /&gt;
**Docker&lt;br /&gt;
&lt;br /&gt;
* '''TO-DO:'''&lt;br /&gt;
* '''Activity reports:'''&lt;br /&gt;
* May 2019&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-05/msg00003.html Week 20 Second Week(Community Bonding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-05/msg00010.html Week 21 Third Week (Community Bonding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-05/msg00017.html Week 22 First Week (Coding)]&lt;br /&gt;
* June 2019&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00009.html Week 23 Second Week (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00016.html Week 24 Third Week  (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00026.html Week 25 Fourth Week (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00033.html Week 26 Fifth Week  (First Evaluation)]&lt;br /&gt;
* July 2019&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-07/msg00003.html Week 27 Sixth Week   (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-07/msg00006.html Week 28 Seventh Week (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-07/msg00011.html Week 29 Eight Week (Coding)(Work Summary)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-07/msg00015.html Week 30 Nineth Week (Second Evaluation)]&lt;br /&gt;
* August 2019&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-08/msg00002.html Week 31 Tenth Week (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-08/msg00004.html Week 32 Eleventh Week (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-08/msg00008.html Week 33 Twelfth Week (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-08/msg00008.html Week 34 Thirteenth Week (Final Evaluation)]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Links ==&lt;br /&gt;
&lt;br /&gt;
* [https://forge.softwareheritage.org/source/swh-lister/    Lister source code repository]&lt;br /&gt;
* [https://forge.softwareheritage.org/source/swh-loader-core/   Loader source code repository]&lt;br /&gt;
* see project [https://summerofcode.withgoogle.com/projects/#5658995887439872 on the GSoC portal]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Category: Google Summer of Code]]&lt;br /&gt;
[[Category: Google Summer of Code 2019]]&lt;/div&gt;</summary>
		<author><name>Archit agrawal</name></author>
	</entry>
	<entry>
		<id>https://wiki.softwareheritage.org/index.php?title=Google_Summer_of_Code_2019/Increase_archive_coverage&amp;diff=1093</id>
		<title>Google Summer of Code 2019/Increase archive coverage</title>
		<link rel="alternate" type="text/html" href="https://wiki.softwareheritage.org/index.php?title=Google_Summer_of_Code_2019/Increase_archive_coverage&amp;diff=1093"/>
		<updated>2019-08-21T15:04:44Z</updated>

		<summary type="html">&lt;p&gt;Archit agrawal: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;* '''Title:''' Increase archive coverage&lt;br /&gt;
* '''Description:''' As Software Heritage works on archiving and sharing source code, one of the major tasks is to ingest the latest source code available in the database from time to time and from all the possible sources where you can fetch the source code using listers and ingest them using loaders. Listers are components that crawl the APIs of software forges (e.g., Bitbucket, Gitorious, Sourceforge, ...) and return a list of the software available in it whereas Loaders take a bundle of software (tarball, Git repository ...) and load it into Software Heritage, by adapting it so that it matches the archive data model. The goal of this project is to increase the archive coverage by making listers and loaders for different websites that which stores source code, so that Software Heritage can fetch as much source code as possible and store it in the database to preserve it for future generations.&lt;br /&gt;
* '''Student:''' Archit Agrawal&lt;br /&gt;
** [https://forge.softwareheritage.org/p/nahimilega/ Forge activity]&lt;br /&gt;
* '''Mentors:'''&lt;br /&gt;
** Nicolas Dandrimont&lt;br /&gt;
** Antoine R. Dumont&lt;br /&gt;
* '''Experience:'''&lt;br /&gt;
* '''What I did:'''&lt;br /&gt;
** '''Listers:'''&lt;br /&gt;
*** Completed and merged&lt;br /&gt;
**** [https://forge.softwareheritage.org/rDLSfedfd73c8e4be8ce1d08b31c9a5cb99f9ca40fd6 Phabricator Lister]&lt;br /&gt;
**** [https://forge.softwareheritage.org/D1482 GNU Lister]&lt;br /&gt;
**** [https://forge.softwareheritage.org/rDLSa9a37a85bf9efac416cfdd152588bf01b7a063b2 CRAN Lister]&lt;br /&gt;
**** [https://forge.softwareheritage.org/D1584 Packagist Lister]&lt;br /&gt;
**** [https://forge.softwareheritage.org/D1610 CGit Lister]&lt;br /&gt;
***  [https://forge.softwareheritage.org/rDLS08ade29e6de0616a3964360454ab52b58c082b75 Add tests to PyPI Lister]&lt;br /&gt;
*** [https://forge.softwareheritage.org/rDLSf424f07c7e628eb7a19d25f4fdb749682d97a21f Refractor base tests for listers]&lt;br /&gt;
***[https://forge.softwareheritage.org/D1441 Add documentation on *How to run a new lister*]&lt;br /&gt;
*** Did research on the methods that could be used to make following listers and made an implementation plan for the same:&lt;br /&gt;
**** [https://forge.softwareheritage.org/T1734 Launchpad Lister]&lt;br /&gt;
**** [https://forge.softwareheritage.org/T1777 Rubygem Lister]&lt;br /&gt;
**** [https://forge.softwareheritage.org/T1718 NuGET(.NET) Lister]&lt;br /&gt;
**** [https://forge.softwareheritage.org/T1724 Maven Lister]&lt;br /&gt;
** '''Loaders:'''&lt;br /&gt;
*** Base Loader&lt;br /&gt;
*** GNU Loader&lt;br /&gt;
&lt;br /&gt;
*'''Challenges:'''&lt;br /&gt;
* '''Learnings:'''&lt;br /&gt;
* '''TO-DO:'''&lt;br /&gt;
* '''Activity reports:'''&lt;br /&gt;
* May 2019&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-05/msg00003.html Week 20 Second Week(Community Bonding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-05/msg00010.html Week 21 Third Week (Community Bonding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-05/msg00017.html Week 22 First Week (Coding)]&lt;br /&gt;
* June 2019&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00009.html Week 23 Second Week (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00016.html Week 24 Third Week  (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00026.html Week 25 Fourth Week (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00033.html Week 26 Fifth Week  (First Evaluation)]&lt;br /&gt;
* July 2019&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-07/msg00003.html Week 27 Sixth Week   (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-07/msg00006.html Week 28 Seventh Week (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-07/msg00011.html Week 29 Eight Week (Coding)(Work Summary)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-07/msg00015.html Week 30 Nineth Week (Second Evaluation)]&lt;br /&gt;
* August 2019&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-08/msg00002.html Week 31 Tenth Week (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-08/msg00004.html Week 32 Eleventh Week (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-08/msg00008.html Week 33 Twelfth Week (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-08/msg00008.html Week 34 Thirteenth Week (Final Evaluation)]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Links ==&lt;br /&gt;
&lt;br /&gt;
* [https://forge.softwareheritage.org/source/swh-lister/    Lister source code repository]&lt;br /&gt;
* [https://forge.softwareheritage.org/source/swh-loader-core//    Loader source code repository]&lt;br /&gt;
* see project [https://summerofcode.withgoogle.com/projects/#5658995887439872 on the GSoC portal]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Category: Google Summer of Code]]&lt;br /&gt;
[[Category: Google Summer of Code 2019]]&lt;/div&gt;</summary>
		<author><name>Archit agrawal</name></author>
	</entry>
	<entry>
		<id>https://wiki.softwareheritage.org/index.php?title=Google_Summer_of_Code_2019/Increase_archive_coverage&amp;diff=1092</id>
		<title>Google Summer of Code 2019/Increase archive coverage</title>
		<link rel="alternate" type="text/html" href="https://wiki.softwareheritage.org/index.php?title=Google_Summer_of_Code_2019/Increase_archive_coverage&amp;diff=1092"/>
		<updated>2019-08-21T09:49:59Z</updated>

		<summary type="html">&lt;p&gt;Archit agrawal: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;* '''Title:''' Increase archive coverage&lt;br /&gt;
* '''Description:''' As Software Heritage works on archiving and sharing source code, one of the major tasks is to ingest the latest source code available in the database from time to time and from all the possible sources where you can fetch the source code using listers and ingest them using loaders. Listers are components that crawl the APIs of software forges (e.g., Bitbucket, Gitorious, Sourceforge, ...) and return a list of the software available in it whereas Loaders take a bundle of software (tarball, Git repository ...) and load it into Software Heritage, by adapting it so that it matches the archive data model. The goal of this project is to increase the archive coverage by making listers and loaders for different websites that which stores source code, so that Software Heritage can fetch as much source code as possible and store it in the database to preserve it for future generations.&lt;br /&gt;
* '''Student:''' Archit Agrawal&lt;br /&gt;
** [https://forge.softwareheritage.org/p/nahimilega/ Forge activity]&lt;br /&gt;
* '''Mentors:'''&lt;br /&gt;
** Nicolas Dandrimont&lt;br /&gt;
** Antoine R. Dumont&lt;br /&gt;
* '''Experience:'''&lt;br /&gt;
* '''What I did:'''&lt;br /&gt;
*'''Challenges:'''&lt;br /&gt;
* '''Learnings:'''&lt;br /&gt;
* '''TO-DO:'''&lt;br /&gt;
* '''Activity reports:'''&lt;br /&gt;
* May 2019&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-05/msg00003.html Week 20 Second Week(Community Bonding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-05/msg00010.html Week 21 Third Week (Community Bonding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-05/msg00017.html Week 22 First Week (Coding)]&lt;br /&gt;
* June 2019&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00009.html Week 23 Second Week (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00016.html Week 24 Third Week  (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00026.html Week 25 Fourth Week (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00033.html Week 26 Fifth Week  (First Evaluation)]&lt;br /&gt;
* July 2019&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-07/msg00003.html Week 27 Sixth Week   (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-07/msg00006.html Week 28 Seventh Week (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-07/msg00011.html Week 29 Eight Week (Coding)(Work Summary)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-07/msg00015.html Week 30 Nineth Week (Second Evaluation)]&lt;br /&gt;
* August 2019&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-08/msg00002.html Week 31 Tenth Week (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-08/msg00004.html Week 32 Eleventh Week (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-08/msg00008.html Week 33 Twelfth Week (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-08/msg00008.html Week 34 Thirteenth Week (Final Evaluation)]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
== Links ==&lt;br /&gt;
&lt;br /&gt;
* [https://forge.softwareheritage.org/source/swh-lister/    Lister source code repository]&lt;br /&gt;
* [https://forge.softwareheritage.org/source/swh-loader-core//    Loader source code repository]&lt;br /&gt;
* see project [https://summerofcode.withgoogle.com/projects/#5658995887439872 on the GSoC portal]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Category: Google Summer of Code]]&lt;br /&gt;
[[Category: Google Summer of Code 2019]]&lt;/div&gt;</summary>
		<author><name>Archit agrawal</name></author>
	</entry>
	<entry>
		<id>https://wiki.softwareheritage.org/index.php?title=Google_Summer_of_Code_2019/Increase_archive_coverage&amp;diff=1088</id>
		<title>Google Summer of Code 2019/Increase archive coverage</title>
		<link rel="alternate" type="text/html" href="https://wiki.softwareheritage.org/index.php?title=Google_Summer_of_Code_2019/Increase_archive_coverage&amp;diff=1088"/>
		<updated>2019-08-18T16:38:57Z</updated>

		<summary type="html">&lt;p&gt;Archit agrawal: /* General information */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;* '''Title:''' Increase archive coverage&lt;br /&gt;
* '''Description:''' As Software Heritage works on archiving and sharing source code, one of the major tasks is to ingest the latest source code available in the database from time to time and from all the possible sources where you can fetch the source code using listers and ingest them using loaders. Listers are components that crawl the APIs of software forges (e.g., Bitbucket, Gitorious, Sourceforge, ...) and return a list of the software available in it whereas Loaders take a bundle of software (tarball, Git repository ...) and load it into Software Heritage, by adapting it so that it matches the archive data model. The goal of this project is to increase the archive coverage by making listers and loaders for different websites that which stores source code, so that Software Heritage can fetch as much source code as possible and store it in the database to preserve it for future generations.&lt;br /&gt;
* '''Student:''' Archit Agrawal&lt;br /&gt;
** [https://forge.softwareheritage.org/p/nahimilega/ Forge activity]&lt;br /&gt;
* '''Mentors:'''&lt;br /&gt;
** Nicolas Dandrimont&lt;br /&gt;
** Antoine R. Dumont&lt;br /&gt;
* '''Experience:'''&lt;br /&gt;
* '''What I did:'''&lt;br /&gt;
*'''Challenges:'''&lt;br /&gt;
* '''Learnings:'''&lt;br /&gt;
* '''TO-DO:'''&lt;br /&gt;
* '''Activity reports:'''&lt;br /&gt;
* May 2019&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-05/msg00003.html Week 20 Second Week(Community Bonding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-05/msg00010.html Week 21 Third Week (Community Bonding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-05/msg00017.html Week 22 First Week (Coding)]&lt;br /&gt;
* June 2019&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00009.html Week 23 Second Week (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00016.html Week 24 Third Week  (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00026.html Week 25 Fourth Week (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00033.html Week 26 Fifth Week  (First Evaluation)]&lt;br /&gt;
* July 2019&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-07/msg00003.html Week 27 Sixth Week   (Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-07/msg00006.html Week 28 Seventh Week (Coding)]&lt;br /&gt;
&lt;br /&gt;
== Links ==&lt;br /&gt;
&lt;br /&gt;
* [https://forge.softwareheritage.org/source/swh-lister/    Lister source code repository]&lt;br /&gt;
* [https://forge.softwareheritage.org/source/swh-loader-core//    Loader source code repository]&lt;br /&gt;
* see project [https://summerofcode.withgoogle.com/projects/#5658995887439872 on the GSoC portal]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Category: Google Summer of Code]]&lt;br /&gt;
[[Category: Google Summer of Code 2019]]&lt;/div&gt;</summary>
		<author><name>Archit agrawal</name></author>
	</entry>
	<entry>
		<id>https://wiki.softwareheritage.org/index.php?title=Google_Summer_of_Code_2019/Increase_archive_coverage&amp;diff=1080</id>
		<title>Google Summer of Code 2019/Increase archive coverage</title>
		<link rel="alternate" type="text/html" href="https://wiki.softwareheritage.org/index.php?title=Google_Summer_of_Code_2019/Increase_archive_coverage&amp;diff=1080"/>
		<updated>2019-07-19T20:08:54Z</updated>

		<summary type="html">&lt;p&gt;Archit agrawal: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;* '''Title:''' Increase archive coverage&lt;br /&gt;
* '''Description:''' As Software Heritage works on archiving and sharing source code, one of the major tasks is to ingest the latest source code available in the database from time to time and from all the possible sources where you can fetch the source code using listers and ingest them using loaders. Listers are components that crawl the APIs of software forges (e.g., Bitbucket, Gitorious, Sourceforge, ...) and return a list of the software available in it whereas Loaders take a bundle of software (tarball, Git repository ...) and load it into Software Heritage, by adapting it so that it matches the archive data model. The goal of this project is to increase the archive coverage by making listers and loaders for different websites that which stores source code, so that Software Heritage can fetch as much source code as possible and store it in the database to preserve it for future generations.&lt;br /&gt;
* '''Student:''' Archit Agrawal&lt;br /&gt;
** [https://forge.softwareheritage.org/p/nahimilega/ Forge activity]&lt;br /&gt;
* '''Mentors:'''&lt;br /&gt;
** Nicolas Dandrimont&lt;br /&gt;
** Antoine R. Dumont&lt;br /&gt;
* '''Activity reports:'''&lt;br /&gt;
* May 2019&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-05/msg00003.html Week 20 Second Week(Community Bonding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-05/msg00010.html Week 21 Third Week(Community Bonding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-05/msg00017.html Week 22 First Week(Coding)]&lt;br /&gt;
* June 2019&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00009.html Week 23 Second Week(Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00016.html Week 24 Third Week(Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00026.html Week 25 Fourth Week(Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00033.html Week 26 Fifth Week(First Evaluation)]&lt;br /&gt;
* July 2019&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-07/msg00003.html Week 27 Sixth Week(Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-07/msg00006.html Week 28 Seventh Week(Coding)]&lt;br /&gt;
&lt;br /&gt;
== Links ==&lt;br /&gt;
&lt;br /&gt;
* [https://forge.softwareheritage.org/source/swh-lister/    Lister source code repository]&lt;br /&gt;
* [https://forge.softwareheritage.org/source/swh-loader-core//    Loader source code repository]&lt;br /&gt;
* see project [https://summerofcode.withgoogle.com/projects/#5658995887439872 on the GSoC portal]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Category:Google Summer of Code]]&lt;br /&gt;
[[Category:Google Summer of Code 2019]]&lt;/div&gt;</summary>
		<author><name>Archit agrawal</name></author>
	</entry>
	<entry>
		<id>https://wiki.softwareheritage.org/index.php?title=Google_Summer_of_Code_2019/Increase_archive_coverage&amp;diff=1079</id>
		<title>Google Summer of Code 2019/Increase archive coverage</title>
		<link rel="alternate" type="text/html" href="https://wiki.softwareheritage.org/index.php?title=Google_Summer_of_Code_2019/Increase_archive_coverage&amp;diff=1079"/>
		<updated>2019-07-19T19:44:45Z</updated>

		<summary type="html">&lt;p&gt;Archit agrawal: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;* '''Title:''' Increase archive coverage&lt;br /&gt;
* '''Description:''' As Software Heritage works on archiving and sharing source code, one of the major tasks is to ingest the latest source code available in the database from time to time and from all the possible sources where you can fetch the source code using listers and ingest them using loaders. Listers are components that crawl the APIs of software forges (e.g., Bitbucket, Gitorious, Sourceforge, ...) and return a list of the software available in it whereas Loaders take a bundle of software (tarball, Git repository ...) and load it into Software Heritage, by adapting it so that it matches the archive data model. The goal of this project is to increase the archive coverage by making listers and loaders for different websites that which stores source code, so that Software Heritage can fetch as much source code as possible and store it in the database to preserve it for future generations.&lt;br /&gt;
* '''Student:''' Archit Agrawal&lt;br /&gt;
** [https://forge.softwareheritage.org/p/nahimilega/ Forge activity]&lt;br /&gt;
* '''Mentors:'''&lt;br /&gt;
** Nicolas Dandrimont&lt;br /&gt;
** Antoine R. Dumont&lt;br /&gt;
* '''Activity reports:'''&lt;br /&gt;
* May 2019&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-05/msg00003.html Week 20 Second Week(Community Bonding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-05/msg00010.html Week 21 Third Week(Community Bonding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-05/msg00017.html Week 22 First Week(Coding)]&lt;br /&gt;
* June 2019&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00009.html Week 23 Second Week(Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00016.html Week 24 Third Week(Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00026.html Week 25 Fourth Week(Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00033.html Week 26 Fourth Week(First Evaluation)]&lt;br /&gt;
* July 2019&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-07/msg00003.html Week 27 Fifth Week(Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-07/msg00006.html Week 28 Sixth Week(Coding)]&lt;br /&gt;
&lt;br /&gt;
== Links ==&lt;br /&gt;
&lt;br /&gt;
* [https://forge.softwareheritage.org/source/swh-lister/    Lister source code repository]&lt;br /&gt;
* [https://forge.softwareheritage.org/source/swh-loader-core//    Loader source code repository]&lt;br /&gt;
* see project [https://summerofcode.withgoogle.com/projects/#5658995887439872 on the GSoC portal]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Category:Google Summer of Code]]&lt;br /&gt;
[[Category:Google Summer of Code 2019]]&lt;/div&gt;</summary>
		<author><name>Archit agrawal</name></author>
	</entry>
	<entry>
		<id>https://wiki.softwareheritage.org/index.php?title=Google_Summer_of_Code_2019/Increase_archive_coverage&amp;diff=1078</id>
		<title>Google Summer of Code 2019/Increase archive coverage</title>
		<link rel="alternate" type="text/html" href="https://wiki.softwareheritage.org/index.php?title=Google_Summer_of_Code_2019/Increase_archive_coverage&amp;diff=1078"/>
		<updated>2019-07-19T19:44:10Z</updated>

		<summary type="html">&lt;p&gt;Archit agrawal: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;* '''Title:''' Increase archive coverage&lt;br /&gt;
* '''Description:''' As Software Heritage works on archiving and sharing source code, one of the major tasks is to ingest the latest source code available in the database from time to time and from all the possible sources where you can fetch the source code using listers and ingest them using loaders. Listers are components that crawl the APIs of software forges (e.g., Bitbucket, Gitorious, Sourceforge, ...) and return a list of the software available in it whereas Loaders take a bundle of software (tarball, Git repository ...) and load it into Software Heritage, by adapting it so that it matches the archive data model. The goal of this project is to increase the archive coverage by making listers and loaders for different websites that which stores source code, so that Software Heritage can fetch as much source code as possible and store it in the database to preserve it for future generations.&lt;br /&gt;
* '''Student:''' Archit Agrawal&lt;br /&gt;
** [https://forge.softwareheritage.org/p/nahimilega/ Forge activity]&lt;br /&gt;
* '''Mentors:'''&lt;br /&gt;
** Nicolas Dandrimont&lt;br /&gt;
** Antoine R. Dumont&lt;br /&gt;
* '''Activity reports:'''&lt;br /&gt;
* May 2019&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-05/msg00003.html Week 20 Second Week(Community Bonding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-05/msg00010.html Week 21 Third Week(Community Bonding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-05/msg00017.html Week 22 First Week(Coding)]&lt;br /&gt;
* June 2019&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00009.html Week 23 Second Week(Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00016.html Week 24 Third Week(Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00026.html Week 25 Fourth Week(Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00033.html Week 26 Fourth Week(First Evaluation)]&lt;br /&gt;
* July 2019&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-07/msg00003.html Week 27 Fourth Week(Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-07/msg00006.html Week 28 Fourth Week(Coding)]&lt;br /&gt;
&lt;br /&gt;
== Links ==&lt;br /&gt;
&lt;br /&gt;
* [https://forge.softwareheritage.org/source/swh-lister/    Lister source code repository]&lt;br /&gt;
* [https://forge.softwareheritage.org/source/swh-loader-core//    Loader source code repository]&lt;br /&gt;
* see project [https://summerofcode.withgoogle.com/projects/#5658995887439872 on the GSoC portal]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Category:Google Summer of Code]]&lt;br /&gt;
[[Category:Google Summer of Code 2019]]&lt;/div&gt;</summary>
		<author><name>Archit agrawal</name></author>
	</entry>
	<entry>
		<id>https://wiki.softwareheritage.org/index.php?title=Google_Summer_of_Code_2019/Increase_archive_coverage&amp;diff=1062</id>
		<title>Google Summer of Code 2019/Increase archive coverage</title>
		<link rel="alternate" type="text/html" href="https://wiki.softwareheritage.org/index.php?title=Google_Summer_of_Code_2019/Increase_archive_coverage&amp;diff=1062"/>
		<updated>2019-06-22T08:16:46Z</updated>

		<summary type="html">&lt;p&gt;Archit agrawal: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;* '''Title:''' Increase archive coverage&lt;br /&gt;
* '''Description:''' As Software Heritage works on archiving and sharing source code, one of the major tasks is to ingest the latest source code available in the database from time to time and from all the possible sources where you can fetch the source code using listers and ingest them using loaders. Listers are components that crawl the APIs of software forges (e.g., Bitbucket, Gitorious, Sourceforge, ...) and return a list of the software available in it whereas Loaders take a bundle of software (tarball, Git repository ...) and load it into Software Heritage, by adapting it so that it matches the archive data model. The goal of this project is to increase the archive coverage by making listers and loaders for different websites that which stores source code, so that Software Heritage can fetch as much source code as possible and store it in the database to preserve it for future generations.&lt;br /&gt;
* '''Student:''' Archit Agrawal&lt;br /&gt;
** [https://forge.softwareheritage.org/p/nahimilega/ Forge activity]&lt;br /&gt;
* '''Mentors:'''&lt;br /&gt;
** Nicolas Dandrimont&lt;br /&gt;
** Antoine R. Dumont&lt;br /&gt;
* '''Activity reports:'''&lt;br /&gt;
* May 2019&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-05/msg00003.html Week 20 Second Week(Community Bonding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-05/msg00010.html Week 21 Third Week(Community Bonding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-05/msg00017.html Week 22 First Week(Coding)]&lt;br /&gt;
* June 2019&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00009.html Week 23 Second Week(Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00016.html Week 24 Third Week(Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00026.html Week 25 Fourth Week(Coding)]&lt;br /&gt;
&lt;br /&gt;
== Links ==&lt;br /&gt;
&lt;br /&gt;
* [https://forge.softwareheritage.org/source/swh-lister/    Lister source code repository]&lt;br /&gt;
* see project [https://summerofcode.withgoogle.com/projects/#5658995887439872 on the GSoC portal]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Category:Google Summer of Code]]&lt;br /&gt;
[[Category:Google Summer of Code 2019]]&lt;/div&gt;</summary>
		<author><name>Archit agrawal</name></author>
	</entry>
	<entry>
		<id>https://wiki.softwareheritage.org/index.php?title=Google_Summer_of_Code_2019/Increase_archive_coverage&amp;diff=1059</id>
		<title>Google Summer of Code 2019/Increase archive coverage</title>
		<link rel="alternate" type="text/html" href="https://wiki.softwareheritage.org/index.php?title=Google_Summer_of_Code_2019/Increase_archive_coverage&amp;diff=1059"/>
		<updated>2019-06-14T13:24:26Z</updated>

		<summary type="html">&lt;p&gt;Archit agrawal: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;* '''Title:''' Increase archive coverage&lt;br /&gt;
* '''Description:''' As Software Heritage works on archiving and sharing source code, one of the major tasks is to ingest the latest source code available in the database from time to time and from all the possible sources where you can fetch the source code using listers and ingest them using loaders. Listers are components that crawl the APIs of software forges (e.g., Bitbucket, Gitorious, Sourceforge, ...) and return a list of the software available in it whereas Loaders take a bundle of software (tarball, Git repository ...) and load it into Software Heritage, by adapting it so that it matches the archive data model. The goal of this project is to increase the archive coverage by making listers and loaders for different websites that which stores source code, so that Software Heritage can fetch as much source code as possible and store it in the database to preserve it for future generations.&lt;br /&gt;
* '''Student:''' Archit Agrawal&lt;br /&gt;
** [https://forge.softwareheritage.org/p/nahimilega/ Forge activity]&lt;br /&gt;
* '''Mentors:'''&lt;br /&gt;
** Nicolas Dandrimont&lt;br /&gt;
** Antoine R. Dumont&lt;br /&gt;
* '''Activity reports:'''&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-05/msg00003.html Week 20 Second Week(Community Bonding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-05/msg00010.html Week 21 Third Week(Community Bonding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-05/msg00017.html Week 22 First Week(Coding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-06/msg00009.html Week 23 Second Week(Coding)]&lt;br /&gt;
&lt;br /&gt;
== Links ==&lt;br /&gt;
&lt;br /&gt;
* [https://forge.softwareheritage.org/source/swh-lister/    Lister source code repository]&lt;br /&gt;
* see project [https://summerofcode.withgoogle.com/projects/#5658995887439872 on the GSoC portal]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Category:Google Summer of Code]]&lt;br /&gt;
[[Category:Google Summer of Code 2019]]&lt;/div&gt;</summary>
		<author><name>Archit agrawal</name></author>
	</entry>
	<entry>
		<id>https://wiki.softwareheritage.org/index.php?title=Google_Summer_of_Code_2019/Increase_archive_coverage&amp;diff=1050</id>
		<title>Google Summer of Code 2019/Increase archive coverage</title>
		<link rel="alternate" type="text/html" href="https://wiki.softwareheritage.org/index.php?title=Google_Summer_of_Code_2019/Increase_archive_coverage&amp;diff=1050"/>
		<updated>2019-05-29T11:00:37Z</updated>

		<summary type="html">&lt;p&gt;Archit agrawal: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;* '''Title:''' Increase archive coverage&lt;br /&gt;
* '''Description:''' As Software Heritage works on archiving and sharing source code, one of the major tasks is to ingest the latest source code available in the database from time to time and from all the possible sources where you can fetch the source code using listers and ingest them using loaders. Listers are components that crawl the APIs of software forges (e.g., Bitbucket, Gitorious, Sourceforge, ...) and return a list of the software available in it whereas Loaders take a bundle of software (tarball, Git repository ...) and load it into Software Heritage, by adapting it so that it matches the archive data model. The goal of this project is to increase the archive coverage by making listers and loaders for different websites that which stores source code, so that Software Heritage can fetch as much source code as possible and store it in the database to preserve it for future generations.&lt;br /&gt;
* '''Student:''' Archit Agrawal&lt;br /&gt;
** [https://forge.softwareheritage.org/p/nahimilega/ Forge activity]&lt;br /&gt;
* '''Mentors:'''&lt;br /&gt;
** Nicolas Dandrimont&lt;br /&gt;
** Antoine R. Dumont&lt;br /&gt;
* '''Activity reports:'''&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-05/msg00003.html Week 20 Second Week(Community Bonding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-05/msg00010.html Week 21 Third Week(Community Bonding)]&lt;br /&gt;
== Links ==&lt;br /&gt;
&lt;br /&gt;
* [https://forge.softwareheritage.org/source/swh-lister/    Lister source code repository]&lt;br /&gt;
* see project [https://summerofcode.withgoogle.com/projects/#5658995887439872 on the GSoC portal]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Category:Google Summer of Code]]&lt;br /&gt;
[[Category:Google Summer of Code 2019]]&lt;/div&gt;</summary>
		<author><name>Archit agrawal</name></author>
	</entry>
	<entry>
		<id>https://wiki.softwareheritage.org/index.php?title=Google_Summer_of_Code_2019/Increase_archive_coverage&amp;diff=1049</id>
		<title>Google Summer of Code 2019/Increase archive coverage</title>
		<link rel="alternate" type="text/html" href="https://wiki.softwareheritage.org/index.php?title=Google_Summer_of_Code_2019/Increase_archive_coverage&amp;diff=1049"/>
		<updated>2019-05-29T11:00:09Z</updated>

		<summary type="html">&lt;p&gt;Archit agrawal: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;* '''Title:''' Increase archive coverage&lt;br /&gt;
* '''Description:''' As Software Heritage works on archiving and sharing source code, one of the major tasks is to ingest the latest source code available in the database from time to time and from all the possible sources where you can fetch the source code using listers and ingest them using loaders. Listers are components that crawl the APIs of software forges (e.g., Bitbucket, Gitorious, Sourceforge, ...) and return a list of the software available in it whereas Loaders take a bundle of software (tarball, Git repository ...) and load it into Software Heritage, by adapting it so that it matches the archive data model. The goal of this project is to increase the archive coverage by making listers and loaders for different websites that which stores source code, so that Software Heritage can fetch as much source code as possible and store it in the database to preserve it for future generations.&lt;br /&gt;
* '''Student:''' Archit Agrawal&lt;br /&gt;
** [https://forge.softwareheritage.org/p/nahimilega/ Forge activity]&lt;br /&gt;
* '''Mentors:'''&lt;br /&gt;
** Nicolas Dandrimont&lt;br /&gt;
** Antoine R. Dumont&lt;br /&gt;
* '''Activity reports:'''&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-05/msg00003.html Week 20 Second Week(Community Bonding)]&lt;br /&gt;
** [https://sympa.inria.fr/sympa/arc/swh-devel/2019-05/msg00010.html Week 21 Second Week(Community Bonding)]&lt;br /&gt;
== Links ==&lt;br /&gt;
&lt;br /&gt;
* [https://forge.softwareheritage.org/source/swh-lister/    Lister source code repository]&lt;br /&gt;
* see project [https://summerofcode.withgoogle.com/projects/#5658995887439872 on the GSoC portal]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Category:Google Summer of Code]]&lt;br /&gt;
[[Category:Google Summer of Code 2019]]&lt;/div&gt;</summary>
		<author><name>Archit agrawal</name></author>
	</entry>
	<entry>
		<id>https://wiki.softwareheritage.org/index.php?title=Suggestion_box:_source_code_to_add&amp;diff=1048</id>
		<title>Suggestion box: source code to add</title>
		<link rel="alternate" type="text/html" href="https://wiki.softwareheritage.org/index.php?title=Suggestion_box:_source_code_to_add&amp;diff=1048"/>
		<updated>2019-05-29T10:53:53Z</updated>

		<summary type="html">&lt;p&gt;Archit agrawal: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;The [[Archive]] is growing organically. We started &amp;quot;small&amp;quot;, tracking 3 '''software origins''' (GitHub + Debian + GNU), and we will be adding new origins bit by bit, depending on the urgency of archiving them and available development energies to integrate them into Software Heritage.&lt;br /&gt;
&lt;br /&gt;
Using this page you can add suggestions of software origins that we aren't following yet, but we should. You can include information about who to contact for technical collaboration, the urgency of archival, and other useful information. To that end, just add a row to the table below. Here some information about the meaning of the various columns.&lt;br /&gt;
&lt;br /&gt;
Entries are currently listed simply in order of addition to this page; we will add more structure when the list will start growing.&lt;br /&gt;
&lt;br /&gt;
=== Legend ===&lt;br /&gt;
&lt;br /&gt;
;Software origin&lt;br /&gt;
: any (public accessible) &amp;quot;place&amp;quot; on the Internet that host software in source code form. Please provide a title for it and hyperlink it to the relevant URL&lt;br /&gt;
;Type of origin&lt;br /&gt;
: information about the kind of hosting, e.g., whether it is a forge, a collection of repositories, an homepage publishing tarball, or a one shot source code repository. For all kind of repositories please specify which VCS system is in use (Git, SVN, CVS, etc.)&lt;br /&gt;
;Contact&lt;br /&gt;
: who to contact for technical collaboration on how to best archive source code hosted on the software origin. You can list yourself if you're the relevant person, or provide the most relevant contact point if you know it&lt;br /&gt;
;Conservation status&lt;br /&gt;
: information about how likely it is that the software origin will disappear; high likelihood will make it more urgent for us to archive software hosted there. We suggest to use the [https://en.wikipedia.org/wiki/Conservation_status species conservation status], i.e., one of: Critically endangered (CR), Endangered (EN), Vulnerable (VU), Near threatened (NT), Least concern (LC).&lt;br /&gt;
;How to mirror&lt;br /&gt;
: (pointers to) technical information on how to do a full mirror of ''all'' the source code available at the software origin, ideally one shot and in batch&lt;br /&gt;
;How to keep up&lt;br /&gt;
: (pointers to) technical information on how to incrementally retrieve new source code accumulated since the last visit; usually this should be based on some kind of incremental change feed or event API&lt;br /&gt;
;Notes&lt;br /&gt;
: anything else you think we should know about this software origin&lt;br /&gt;
;Forge task&lt;br /&gt;
: pointer to the task on our [[forge]] tracking the work needed to ingest the software origin&lt;br /&gt;
&lt;br /&gt;
== Suggestions ==&lt;br /&gt;
&lt;br /&gt;
* https://notabug.org/ (customized gogs)&lt;br /&gt;
* https://gitgud.io/ (GitLab, run by Sapphire, a user-supported open source company)&lt;br /&gt;
* https://octo.sh/ (GitLab)&lt;br /&gt;
* https://chaos.expert/explore (GitLab by Chaos Computer Club)&lt;br /&gt;
* https://gitlab.coko.foundation/public (GitLab)&lt;br /&gt;
* https://git.teknik.io/explore/repos (Gitea)&lt;br /&gt;
* https://gitlab.gnome.org/explore/groups (GNOME software)&lt;br /&gt;
* https://launchpad.net/ (used by Ubuntu and others)&lt;br /&gt;
* https://archive.codeplex.com/ (was Microsoft's free, open source project hosting)&lt;br /&gt;
* https://wiki.p2pfoundation.net/List_of_Community-Hosted_GitLab_Instances&lt;br /&gt;
* https://0xacab.org/explore (Gitlab)&lt;br /&gt;
* https://git.fosscommunity.in/explore/projects (GitLab by Free Software Community of India)&lt;br /&gt;
&lt;br /&gt;
{| class=&amp;quot;wikitable&amp;quot;&lt;br /&gt;
!Software origin&lt;br /&gt;
!Type of origin&lt;br /&gt;
!Contact&lt;br /&gt;
!Conservation status&lt;br /&gt;
!How to mirror&lt;br /&gt;
!How to keep up&lt;br /&gt;
!Notes&lt;br /&gt;
![https://forge.softwareheritage.org/ Forge] task&lt;br /&gt;
|-&lt;br /&gt;
|'''(sample entry)''' ''GitHubBub forge''&lt;br /&gt;
|''Git hosting''&lt;br /&gt;
|''John Doe &amp;lt;john@example.com&amp;gt;''&lt;br /&gt;
|''LC''&lt;br /&gt;
|''retrieve full repo list at /api/list, then git clone on each entry''&lt;br /&gt;
|''poll RSS feed at /api/updates?since=YYYY-MM-DD''&lt;br /&gt;
|''nothing special to add''&lt;br /&gt;
|''[https://forge.softwareheritage.org/T123456 T123456]''&lt;br /&gt;
|-&lt;br /&gt;
|[https://bitbucket.org/ Bitbucket]&lt;br /&gt;
|Git and hg/Mercurial hosting&lt;br /&gt;
|&lt;br /&gt;
|style=&amp;quot;background-color: lightgreen&amp;quot;| LC&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|[https://forge.softwareheritage.org/T561 T561]&lt;br /&gt;
|-&lt;br /&gt;
|[https://sourceforge.net/ SourceForge]&lt;br /&gt;
|CVS, SVN, Mercurial, Git&lt;br /&gt;
|&lt;br /&gt;
|style=&amp;quot;background-color: orange&amp;quot;|VU&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[https://wiki.debian.org/Derivatives/Census all Debian derivatives]&lt;br /&gt;
|Debian-based distros&lt;br /&gt;
|Paul Wise &amp;lt;pabs@debian.org&amp;gt;&lt;br /&gt;
|varying, depending on the distro&lt;br /&gt;
|see [[Suggestion_box:_source_code_to_add/Debian_derivatives|details]]&lt;br /&gt;
|see [[Suggestion_box:_source_code_to_add/Debian_derivatives|details]]&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.gentoo.org/ Gentoo]&lt;br /&gt;
|&lt;br /&gt;
|Johannes Kellner &amp;lt;gentoo@johannes-kellner.eu&amp;gt;&lt;br /&gt;
|style=&amp;quot;background-color: lightgreen&amp;quot;|LC&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[http://pauillac.inria.fr/~huet/cea.html Gérard Huet's seminal work on 3D]&lt;br /&gt;
|Scanned source code&lt;br /&gt;
|Gérard Huet &amp;lt;gerard.huet@inria.fr&amp;gt;&lt;br /&gt;
|style=&amp;quot;background-color: red&amp;quot;|EN&lt;br /&gt;
|retrieve listing images from the web pages&lt;br /&gt;
|N/A&lt;br /&gt;
|links are half broken, yquem should be replaced with pauillac everywhere it appears&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[http://www.softwarepreservation.org/projects Software Preservation Project]&lt;br /&gt;
|Website with a collection of archives&lt;br /&gt;
|Paul McJones &amp;lt;paul@mcjones.org&amp;gt;&lt;br /&gt;
|style=&amp;quot;background-color: lightgreen&amp;quot;|LC&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[https://code.nasa.gov/ 253 NASA open source software projects]&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|style=&amp;quot;background-color: lightgreen&amp;quot;|LC&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[http://smaky.ch/ Smaky], the swiss micro-computer series&lt;br /&gt;
|[http://infini.smaky.ch/sources.html Partial code dump]&lt;br /&gt;
|[mailto:arnaud@epsitec.ch Pierre Arnaud] (current CEO of Epsitec) and/or [mailto:jean-daniel.nicoud@epfl.ch Jean-Daniel Nicoud] (founder of the computer series]&lt;br /&gt;
|style=&amp;quot;background-color: red&amp;quot;|EN&lt;br /&gt;
|Probably manually&lt;br /&gt;
|No new updates&lt;br /&gt;
|Some references to this history: [http://www.memoires-informatiques.org/ Fondation Mémoires Informatiques], [http://smaky.ch/ Smaky.ch] (in particular, [http://smaky.ch/theme.php?id=lami the short history]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.wikidata.org/wiki/Wikidata:WikiProject_Informatics/FLOSS#Conservation_status_2 wikidata endangered software]&lt;br /&gt;
|depends on the &amp;quot;source code repository&amp;quot; property&lt;br /&gt;
|Loic Dachary &amp;lt;loic@dachary.org&amp;gt;&lt;br /&gt;
|style=&amp;quot;background-color: yellow&amp;quot;|The risk is higher than [https://www.wikidata.org/wiki/Property_talk:P141 LC]&lt;br /&gt;
|A script should obtain the &amp;quot;source code repository&amp;quot; property for the software and mirror it depending on the [https://www.wikidata.org/wiki/Wikidata:WikiProject_Informatics/FLOSS#source_code_repository protocol] qualifier. If the &amp;quot;source code repository&amp;quot; is &amp;quot;no value&amp;quot;, the [https://www.wikidata.org/wiki/Wikidata:WikiProject_Informatics/Software#streaming_media_URL streaming media URL] of the &amp;quot;preferred&amp;quot; [https://www.wikidata.org/wiki/Wikidata:WikiProject_Informatics/Software#software_version_.28P348.29 software version] should be downloaded instead.&lt;br /&gt;
|Once a copy is secured by software heritage, a URL to the software heritage repository should be added to the &amp;quot;source code repository&amp;quot; property and the &amp;quot;conservation status&amp;quot; property should be removed, meaning it is &amp;quot;least concerned&amp;quot; by default. The software will no longer show in the list of endangered software.&lt;br /&gt;
|This is work in progress, part of the [https://www.wikidata.org/wiki/Wikidata:WikiProject_Informatics/FLOSS wikidata FLOSS project] and the scripts do not exist yet.&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|historical KDE repos&lt;br /&gt;
|CVS/SVN/Git&lt;br /&gt;
|KDE sysadmin team &amp;lt;sysadmin@kde.org&amp;gt;&lt;br /&gt;
|style=&amp;quot;background-color: yellow&amp;quot;|NT&lt;br /&gt;
|See [[Suggestion box: source code to add/KDE|details]]&lt;br /&gt;
|See [[Suggestion box: source code to add/KDE|details]]&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[https://java.net/projects Java.net] &amp;amp; [https://kenai.com/ Kenai.com]&lt;br /&gt;
|hg, git, svn&lt;br /&gt;
|communitymanager@java.net&lt;br /&gt;
|style=&amp;quot;background-color: black; color: white&amp;quot;|CR&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|[https://community.oracle.com/community/java/javanet-forge-sunset Shutting down on April 28, 2017]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[https://fedorahosted.org/ fedorahosted.org]&lt;br /&gt;
|git, svn, hg, bzr&lt;br /&gt;
|[https://lists.fedoraproject.org/admin/lists/infrastructure@lists.fedoraproject.org infrastructure@lists.fedoraproject.org]&lt;br /&gt;
|style=&amp;quot;background-color: black; color: white&amp;quot;|CR&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|[https://communityblog.fedoraproject.org/fedorahosted-sunset-2017-02-28/ Shutting down on Feb. 28, 2017]&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[http://www.societe-informatique-de-france.fr/wp-content/uploads/2015/12/1024-no7-Baude.pdf Langage symbolique d'Enseignement (LSE)]&lt;br /&gt;
|archives&lt;br /&gt;
|Association Enseignement Public et Informatique (EPI) &amp;lt;bureau@epi.asso.fr&amp;gt;&lt;br /&gt;
|style=&amp;quot;background-color: black; color: white&amp;quot;|CR&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
| + educational software (INRP-CNDP)&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[http://www.netlib.org The Netlib collection of numerical software]&lt;br /&gt;
|structured website with links to archives&lt;br /&gt;
|&lt;br /&gt;
|style=&amp;quot;background-color: lightgreen&amp;quot;|LC&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|many of these libraries are mirrored in sources already collected in Software Heritage; there is sure value in the curation information.&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[https://codebender.cc/ codebender]&lt;br /&gt;
|IOT and educationnal ressources&lt;br /&gt;
|&lt;br /&gt;
|style=&amp;quot;background-color: black; color: white&amp;quot;|CR&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|closing expected at Dec31st, 2016 upon this announce by founders https://codebender.cc/next-chapter&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[https://git.oschina.net/ OS China]&lt;br /&gt;
|Chinese GitHub equivalent&lt;br /&gt;
|&lt;br /&gt;
|style=&amp;quot;background-color: lightgreen&amp;quot;|LC&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|interesting test case for all the unicode tooling in Software Heritage&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|Usenet source code archives&lt;br /&gt;
|NNTP&lt;br /&gt;
|&lt;br /&gt;
|style=&amp;quot;background-color: yellow&amp;quot;|NT&lt;br /&gt;
|crawl relevant newsgroup archives (e.g., at Google Groups), retrieve archives (possibly chunked), ingest&lt;br /&gt;
|one shot might be enough?&lt;br /&gt;
|suggestion by John Gilmore&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.x.org/releases/ X11/XOrg archives]&lt;br /&gt;
|http&lt;br /&gt;
|&lt;br /&gt;
|style=&amp;quot;background-color: lightgreen&amp;quot;|LC&lt;br /&gt;
|&lt;br /&gt;
|one shot might be enough&lt;br /&gt;
|&lt;br /&gt;
||''[https://forge.softwareheritage.org/T1774 T1774]''&lt;br /&gt;
|-&lt;br /&gt;
|[https://www.curseforge.com/ Curse mods]&lt;br /&gt;
|Code distributed as versioned tarballs&lt;br /&gt;
|&lt;br /&gt;
|style=&amp;quot;background-color: lightgreen&amp;quot;|LC&lt;br /&gt;
|The API should be sufficient, maybe some scraping will be required&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|Symbian source code&lt;br /&gt;
|HG&lt;br /&gt;
|carlo.daffara@nodeweaver.eu&lt;br /&gt;
|style=&amp;quot;background-color: orange&amp;quot;|VU&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|Cloned from Nokia's Symbian Mercurial repository, a few days before the closure of the repo and the change to a proprietary license. The mercurial HG files are the only copy publicly available; I have made a snapshot of the code and placed on sourceforge- but missing is the entire project history and commit log.&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[https://gist.github.com/ GitHub's gists]&lt;br /&gt;
|git&lt;br /&gt;
|&lt;br /&gt;
|style=&amp;quot;background-color: lightgreen&amp;quot;|LC&lt;br /&gt;
|No proper listing API, but there's [https://developer.github.com/v3/gists/#list-all-public-gists an endpoint] to get gists created after a given date&lt;br /&gt;
|use the same endpoint&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[https://codeberg.org/ Codeberg]&lt;br /&gt;
|git&lt;br /&gt;
|contact@codeberg.org&lt;br /&gt;
|style=&amp;quot;background-color: lightgreen&amp;quot;|LC&lt;br /&gt;
|[https://github.com/go-gitea Gitea] API&lt;br /&gt;
|[https://github.com/go-gitea Gitea] API&lt;br /&gt;
|Codeberg e.V. is a Non-Profit Collaboration Community for Free and Open Source Projects&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
|[https://puszcza.gnu.org.ua/ Puszcza]&lt;br /&gt;
|cvs/git/hg/svn, VCS snapshot tarballs, tarballs&lt;br /&gt;
|https://puszcza.gnu.org.ua/contact.php&lt;br /&gt;
|style=&amp;quot;background-color: lightgreen&amp;quot;|LC&lt;br /&gt;
|Savane instance, https://download.gnu.org.ua/ (also has ftp), http://git.gnu.org.ua/&lt;br /&gt;
|Savane instance&lt;br /&gt;
|&lt;br /&gt;
|&lt;br /&gt;
|-&lt;br /&gt;
&lt;br /&gt;
[[Category:Archive]]&lt;br /&gt;
[[Category:Suggestions]]&lt;/div&gt;</summary>
		<author><name>Archit agrawal</name></author>
	</entry>
	<entry>
		<id>https://wiki.softwareheritage.org/index.php?title=Google_Summer_of_Code_2019/Increase_archive_coverage&amp;diff=1046</id>
		<title>Google Summer of Code 2019/Increase archive coverage</title>
		<link rel="alternate" type="text/html" href="https://wiki.softwareheritage.org/index.php?title=Google_Summer_of_Code_2019/Increase_archive_coverage&amp;diff=1046"/>
		<updated>2019-05-24T04:02:53Z</updated>

		<summary type="html">&lt;p&gt;Archit agrawal: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;* '''Title:''' Increase archive coverage&lt;br /&gt;
* '''Description:''' As Software Heritage works on archiving and sharing source code, one of the major tasks is to ingest the latest source code available in the database from time to time and from all the possible sources where you can fetch the source code using listers and ingest them using loaders. Listers are components that crawl the APIs of software forges (e.g., Bitbucket, Gitorious, Sourceforge, ...) and return a list of the software available in it whereas Loaders take a bundle of software (tarball, Git repository ...) and load it into Software Heritage, by adapting it so that it matches the archive data model. The goal of this project is to increase the archive coverage by making listers and loaders for different websites that which stores source code, so that Software Heritage can fetch as much source code as possible and store it in the database to preserve it for future generations.&lt;br /&gt;
* '''Student:''' Archit Agrawal&lt;br /&gt;
** [https://forge.softwareheritage.org/p/nahimilega/ Forge activity]&lt;br /&gt;
* '''Mentors:'''&lt;br /&gt;
** Nicolas Dandrimont&lt;br /&gt;
** Antoine R. Dumont&lt;br /&gt;
* '''Activity reports:'''&lt;br /&gt;
** [[https://sympa.inria.fr/sympa/arc/swh-devel/2019-05/msg00003.html Second Week(Community Bonding)]]&lt;br /&gt;
&lt;br /&gt;
== Links ==&lt;br /&gt;
&lt;br /&gt;
* [https://forge.softwareheritage.org/source/swh-lister/    Lister source code repository]&lt;br /&gt;
* see project [https://summerofcode.withgoogle.com/projects/#5658995887439872 on the GSoC portal]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Category:Google Summer of Code]]&lt;br /&gt;
[[Category:Google Summer of Code 2019]]&lt;/div&gt;</summary>
		<author><name>Archit agrawal</name></author>
	</entry>
	<entry>
		<id>https://wiki.softwareheritage.org/index.php?title=Google_Summer_of_Code_2019/Increase_archive_coverage&amp;diff=1042</id>
		<title>Google Summer of Code 2019/Increase archive coverage</title>
		<link rel="alternate" type="text/html" href="https://wiki.softwareheritage.org/index.php?title=Google_Summer_of_Code_2019/Increase_archive_coverage&amp;diff=1042"/>
		<updated>2019-05-15T15:06:37Z</updated>

		<summary type="html">&lt;p&gt;Archit agrawal: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;* '''Title:''' Increase archive coverage&lt;br /&gt;
* '''Description:''' As Software Heritage works on archiving and sharing source code, one of the major tasks is to ingest the latest source code available in the database from time to time and from all the possible sources where you can fetch the source code using listers and ingest them using loaders. Listers are components that crawl the APIs of software forges (e.g., Bitbucket, Gitorious, Sourceforge, ...) and return a list of the software available in it whereas Loaders take a bundle of software (tarball, Git repository ...) and load it into Software Heritage, by adapting it so that it matches the archive data model. The goal of this project is to increase the archive coverage by making listers and loaders for different websites that which stores source code, so that Software Heritage can fetch as much source code as possible and store it in the database to preserve it for future generations.&lt;br /&gt;
* '''Student:''' Archit Agrawal&lt;br /&gt;
** [https://forge.softwareheritage.org/p/nahimilega/ Forge activity]&lt;br /&gt;
* '''Mentors:'''&lt;br /&gt;
** Nicolas Dandrimont&lt;br /&gt;
** Antoine R. Dumont&lt;br /&gt;
* '''Activity reports:'''&lt;br /&gt;
** &lt;br /&gt;
&lt;br /&gt;
== Links ==&lt;br /&gt;
&lt;br /&gt;
* [https://forge.softwareheritage.org/source/swh-lister/    Lister source code repository]&lt;br /&gt;
* see project [https://summerofcode.withgoogle.com/projects/#5658995887439872 on the GSoC portal]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Category:Google Summer of Code]]&lt;br /&gt;
[[Category:Google Summer of Code 2019]]&lt;/div&gt;</summary>
		<author><name>Archit agrawal</name></author>
	</entry>
	<entry>
		<id>https://wiki.softwareheritage.org/index.php?title=Google_Summer_of_Code_2019/Increase_archive_coverage&amp;diff=1041</id>
		<title>Google Summer of Code 2019/Increase archive coverage</title>
		<link rel="alternate" type="text/html" href="https://wiki.softwareheritage.org/index.php?title=Google_Summer_of_Code_2019/Increase_archive_coverage&amp;diff=1041"/>
		<updated>2019-05-15T15:03:18Z</updated>

		<summary type="html">&lt;p&gt;Archit agrawal: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;* '''Title:''' Increase archive coverage&lt;br /&gt;
* '''Description:''' As Software Heritage works on archiving and sharing source code, one of the major tasks is to ingest the latest source code available in the database from time to time and from all the possible sources where you can fetch the source code using listers and ingest them using loaders. Listers are components that crawl the APIs of software forges (e.g., Bitbucket, Gitorious, Sourceforge, ...) and return a list of the software available in it whereas Loaders take a bundle of software (tarball, Git repository ...) and load it into Software Heritage, by adapting it so that it matches the archive data model. The goal of this project is to increase the archive coverage by making listers and loaders for different websites that which stores source code, so that Software Heritage can fetch as much source code as possible and store it in the database to preserve it for future generations.&lt;br /&gt;
* '''Student:''' Archit Agrawal&lt;br /&gt;
** [https://forge.softwareheritage.org/p/nahimilega/ forge activity]&lt;br /&gt;
* '''Mentors:'''&lt;br /&gt;
** Nicolas Dandrimont&lt;br /&gt;
** Antoine R. Dumont&lt;br /&gt;
* '''Activity reports:'''&lt;br /&gt;
** &lt;br /&gt;
&lt;br /&gt;
== Links ==&lt;br /&gt;
&lt;br /&gt;
* [http://example.com source code repository]&lt;br /&gt;
* see project [https://summerofcode.withgoogle.com/projects/#5658995887439872 on the GSoC portal]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Category:Google Summer of Code]]&lt;br /&gt;
[[Category:Google Summer of Code 2019]]&lt;/div&gt;</summary>
		<author><name>Archit agrawal</name></author>
	</entry>
	<entry>
		<id>https://wiki.softwareheritage.org/index.php?title=Google_Summer_of_Code_2019/Increase_archive_coverage&amp;diff=1040</id>
		<title>Google Summer of Code 2019/Increase archive coverage</title>
		<link rel="alternate" type="text/html" href="https://wiki.softwareheritage.org/index.php?title=Google_Summer_of_Code_2019/Increase_archive_coverage&amp;diff=1040"/>
		<updated>2019-05-15T14:56:03Z</updated>

		<summary type="html">&lt;p&gt;Archit agrawal: &lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;* '''Title:''' Increase archive coverage&lt;br /&gt;
* '''Description:''' As Software Heritage works on archiving and sharing source code. One of the major tasks is to ingest the latest source code available in the database from time to time and from all the possible sources where you can fetch the source code using listers and ingest them using loaders. Listers are components that crawl the APIs of software forges (e.g., Bitbucket, Gitorious, Sourceforge, ...) and return a list of the software available in it whereas Loaders take a bundle of software (tarball, Git repository, Python package, ...) and load it into Software Heritage, by adapting it so that it matches the archive data model.&lt;br /&gt;
&lt;br /&gt;
So the goal of this project is to make specialised softwares which are called Listers and Loaders&lt;br /&gt;
The goal of this  to increase the archive coverage by making listers and loaders for different websites that which stores source code, so that Software Heritage can fetch as much source code as possible and store it in the database to preserve it for future generations&lt;br /&gt;
* '''Student:''' Archit Agrawal&lt;br /&gt;
** [https://forge.softwareheritage.org/p/nahimilega/ forge activity]&lt;br /&gt;
* '''Mentors:'''&lt;br /&gt;
** Nicolas Dandrimont&lt;br /&gt;
** Antoine R. Dumont&lt;br /&gt;
* '''Activity reports:'''&lt;br /&gt;
** &lt;br /&gt;
&lt;br /&gt;
== Links ==&lt;br /&gt;
&lt;br /&gt;
* [http://example.com source code repository]&lt;br /&gt;
* see project [http://example.com on the GSoC portal]&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
[[Category:Google Summer of Code]]&lt;br /&gt;
[[Category:Google Summer of Code 2019]]&lt;/div&gt;</summary>
		<author><name>Archit agrawal</name></author>
	</entry>
	<entry>
		<id>https://wiki.softwareheritage.org/index.php?title=Google_Season_of_Docs_2019&amp;diff=1029</id>
		<title>Google Season of Docs 2019</title>
		<link rel="alternate" type="text/html" href="https://wiki.softwareheritage.org/index.php?title=Google_Season_of_Docs_2019&amp;diff=1029"/>
		<updated>2019-05-13T20:28:09Z</updated>

		<summary type="html">&lt;p&gt;Archit agrawal: /* General information */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;[[File:GSoDLogo.png|800px]]&lt;br /&gt;
&lt;br /&gt;
== General information ==&lt;br /&gt;
&lt;br /&gt;
This page is the central point of information for [[Software Heritage]] participation into the [https://developers.google.com/season-of-docs/ Google Season of Docs] program.&lt;br /&gt;
&lt;br /&gt;
Google Season of Docs is a program where Google pays technical writers stipends to work on free software projects such as Software Heritage. Each writer works with mentors from the community to complete a documentation project.&lt;br /&gt;
&lt;br /&gt;
== I want to participate as a technical writer ==&lt;br /&gt;
&lt;br /&gt;
Great!, we are very glad for your interest in contributing to Software Heritage and we are looking forward to work together.&lt;br /&gt;
&lt;br /&gt;
=== Prerequisites ===&lt;br /&gt;
&lt;br /&gt;
The following prerequisites apply to Software Heritage GSoD projects:&lt;br /&gt;
&lt;br /&gt;
* [http://www.sphinx-doc.org/ Sphinx] is our documentation system of choice, you should be familiar with it to apply. In particular, we generally use reStructuredText markup and (for API references) Python docstrings with the [https://www.sphinx-doc.org/en/master/usage/extensions/napoleon.html Napoleon style]&lt;br /&gt;
* [https://git-scm.com Git] is our version control system of choice, you should be familiar with it to apply&lt;br /&gt;
* additional prerequisites depend on the project you will work on; check project descriptions for details&lt;br /&gt;
&lt;br /&gt;
=== Before you apply ===&lt;br /&gt;
&lt;br /&gt;
Here are the steps you should follow before applying, to make sure you have a general idea of the current state of Software Heritage technical documentation:&lt;br /&gt;
&lt;br /&gt;
* Learn about our project via our [https://www.softwareheritage.org/ main website] and the actual [https://archive.softwareheritage.org/ source code archive]&lt;br /&gt;
* Check out the index of our [https://www.softwareheritage.org/community/developers/ resources for developers]&lt;br /&gt;
* In particular make sure to have a look at:&lt;br /&gt;
** our [https://wiki.softwareheritage.org/ public wiki]&lt;br /&gt;
** our [https://docs.softwareheritage.org/devel/ documentation index]&lt;br /&gt;
* Technical setup:&lt;br /&gt;
** Create an account on our [https://forge.softwareheritage.org development forge]&lt;br /&gt;
** Familiarize yourself with our [[Code review in Phabricator|code review workflow]]&lt;br /&gt;
** Make a simple change to the documentation of any one of our [https://docs.softwareheritage.org/devel/ software components] and submit it as a [https://forge.softwareheritage.org/differential/ diff] for code review, following the above workflow. Feel free to submit any patch you think it might be useful.&lt;br /&gt;
&lt;br /&gt;
=== What to include in your application ===&lt;br /&gt;
&lt;br /&gt;
Make sure that your application includes the following information:&lt;br /&gt;
&lt;br /&gt;
* Describe the '''specific project''' you want to work on. What do you want to achieve? Why is it important? Why is it useful for Software Heritage? The project might be one of the project ideas that we have prepared below, or something else entirely that you want to contribute to Software Heritage. Your pet peeve, surprise us!&lt;br /&gt;
* Detail your '''work plan''': a brief description of how you plan to go about your project, including a list of  ''deliverables'' and a ''timeline'' of when do you expect them to be available.&lt;br /&gt;
* Include a reference to '''the diff''' you submitted before applying (see the &amp;quot;Before you apply&amp;quot; section above).&lt;br /&gt;
&lt;br /&gt;
== Ideas list ==&lt;br /&gt;
&lt;br /&gt;
Below you can find a list of project ideas that are good options for a&lt;br /&gt;
reasonably sized GSoD project.  They are just suggestion though, don't feel&lt;br /&gt;
obliged to pick one of them if there is nothing that fits your taste and&lt;br /&gt;
abilities.  Feel free to propose something else that you are excited about and&lt;br /&gt;
that contributes to improve the Software Heritage documentation: we will be&lt;br /&gt;
happy to consider it!&lt;br /&gt;
&lt;br /&gt;
=== Reorganize developer documentation as tutorials/how-to/discussions/references ===&lt;br /&gt;
&lt;br /&gt;
Our [https://docs.softwareheritage.org/devel/ developer documentation] is not particularly structured, making it hard to use it properly.&lt;br /&gt;
We would like to reorganize it following the [https://www.youtube.com/watch?v=t4vKPhjcMZg tutorials/how-to/discussions/references taxonomy], or something equally sensible.&lt;br /&gt;
&lt;br /&gt;
This project will require discussing the intended use cases of this page, conceptually structure the future version of it, implement it, and (re)write suitable language to introduce the various parts.&lt;br /&gt;
&lt;br /&gt;
=== Write a high-level technical overview of the project, data model, and data flow ===&lt;br /&gt;
&lt;br /&gt;
We have described the data model and archival data flow of Software Heritage in [https://upsilon.cc/~zack/research/publications/ipres-2018-doi.pdf various] [https://upsilon.cc/~zack/research/publications/cacm-2018-software-heritage.pdf scientific] [https://upsilon.cc/~zack/research/publications/msr-2019-swh.pdf papers], but haven't really worked on a general, high-level, technical presentation of it that targets developers.&lt;br /&gt;
The current [https://docs.softwareheritage.org/devel/swh-model/data-model.html#data-model data model documentation] has been copy-pasted from scientific papers.&lt;br /&gt;
We want to revisit it, to make sure the language description is suitable for a more general technical public, possibly complementing it with more abstract data-structure descriptions and cross-references to the code implementing the model and the workflow in the actual implementation.&lt;br /&gt;
&lt;br /&gt;
=== Design and document consistent writing conventions for Python docstrings ===&lt;br /&gt;
&lt;br /&gt;
The docstrings of the [https://docs.softwareheritage.org/devel/#components various software components] in the Software Heritage stack are not very consistent.&lt;br /&gt;
They have been written by a number of different developers, with varying degrees of English proficiency.&lt;br /&gt;
&lt;br /&gt;
We would like to document, as a set of writing guidelines, how to write them to the benefits of current and future developers.&lt;br /&gt;
The guidelines should include what to write in docstrings and whatnot, writing styles and guidance, examples of good and bad content for them, etc.&lt;br /&gt;
&lt;br /&gt;
To the extent it is possible, this project will also explore automating checks for guidelines conformance, in the form of basic structural (e.g., are all function parameters documented? is the function name correct? etc.) and writing checks (e.g., spellchecking, grammar checking, etc.).&lt;br /&gt;
&lt;br /&gt;
=== Revamp new (code) contributor tutorial ===&lt;br /&gt;
&lt;br /&gt;
The current [https://docs.softwareheritage.org/devel/#getting-started getting started documentation] include two overlapping documents: one about how to ''run'' a local instance of the full Software Heritage stack, another about how to create a local setup for current and future project ''developers''.&lt;br /&gt;
We would like to review those documents for consistency and, more generally, revamp documentation that will help wannabe code contributors to quickly get started and submit patches.&lt;br /&gt;
&lt;br /&gt;
As part of this project we might also want to revamp glossary, data model and other cross-cutting documents that help newcomers understand where-is-what in the code base and how to practically prepare and send us patches.&lt;br /&gt;
&lt;br /&gt;
=== Restructure wiki landing page as main community entry point ===&lt;br /&gt;
&lt;br /&gt;
Our [[Main_Page|public wiki landing page]] is, well, a mess.&lt;br /&gt;
It has grown inorganically by accumulating links to pages used more as working drafts than reasonably structured documents.&lt;br /&gt;
We would like to restructure it to cater for various use cases (users, developers, students, curators, scientists, etc.) in the attempt of making it the primary entry point into Software Heritage for all community members.&lt;br /&gt;
&lt;br /&gt;
This project will require discussing the various uses cases, design a suitable page layout, implement it in the wiki, write the introductory parts for each path into the wiki.&lt;br /&gt;
A related task, if time permits, will be to reorganize the wiki [[Special:Categories|category taxonomy]] and document when/how to use the available categories in the future.&lt;br /&gt;
&lt;br /&gt;
Working on this project will require familiarity with [https://www.mediawiki.org/wiki/MediaWiki Mediawiki] and its markup language.&lt;br /&gt;
&lt;br /&gt;
== Contact ==&lt;br /&gt;
&lt;br /&gt;
GSoD applicants are encouraged to get in touch with the Software Heritage community using the standard development communication channels, i.e.:&lt;br /&gt;
&lt;br /&gt;
* the #swh-devel IRC channel on [https://freenode.net Freenode]&lt;br /&gt;
* the [https://sympa.inria.fr/sympa/info/swh-devel swh-devel mailing list]&lt;br /&gt;
&lt;br /&gt;
See our [https://www.softwareheritage.org/community/developers/ development information page] for more details.&lt;br /&gt;
&lt;br /&gt;
== Timeline ==&lt;br /&gt;
&lt;br /&gt;
See the official [https://developers.google.com/season-of-docs/docs/timeline Google Season of Docs timeline].&lt;/div&gt;</summary>
		<author><name>Archit agrawal</name></author>
	</entry>
	<entry>
		<id>https://wiki.softwareheritage.org/index.php?title=Google_Season_of_Docs_2019&amp;diff=1028</id>
		<title>Google Season of Docs 2019</title>
		<link rel="alternate" type="text/html" href="https://wiki.softwareheritage.org/index.php?title=Google_Season_of_Docs_2019&amp;diff=1028"/>
		<updated>2019-05-13T20:22:24Z</updated>

		<summary type="html">&lt;p&gt;Archit agrawal: /* General information */&lt;/p&gt;
&lt;hr /&gt;
&lt;div&gt;[[File:GSoDLogo.png|800px]]&lt;br /&gt;
&lt;br /&gt;
== General information ==&lt;br /&gt;
&lt;br /&gt;
This page is the central point of information for [[Software Heritage]] participation into the [https://developers.google.com/season-of-docs/ Google Season of Docs] program.&lt;br /&gt;
&lt;br /&gt;
Google Summer of Docs is a program where Google pays technical writers stipends to work on free software projects such as Software Heritage. Each writer works with mentors from the community to complete a documentation project.&lt;br /&gt;
&lt;br /&gt;
== I want to participate as a technical writer ==&lt;br /&gt;
&lt;br /&gt;
Great!, we are very glad for your interest in contributing to Software Heritage and we are looking forward to work together.&lt;br /&gt;
&lt;br /&gt;
=== Prerequisites ===&lt;br /&gt;
&lt;br /&gt;
The following prerequisites apply to Software Heritage GSoD projects:&lt;br /&gt;
&lt;br /&gt;
* [http://www.sphinx-doc.org/ Sphinx] is our documentation system of choice, you should be familiar with it to apply. In particular, we generally use reStructuredText markup and (for API references) Python docstrings with the [https://www.sphinx-doc.org/en/master/usage/extensions/napoleon.html Napoleon style]&lt;br /&gt;
* [https://git-scm.com Git] is our version control system of choice, you should be familiar with it to apply&lt;br /&gt;
* additional prerequisites depend on the project you will work on; check project descriptions for details&lt;br /&gt;
&lt;br /&gt;
=== Before you apply ===&lt;br /&gt;
&lt;br /&gt;
Here are the steps you should follow before applying, to make sure you have a general idea of the current state of Software Heritage technical documentation:&lt;br /&gt;
&lt;br /&gt;
* Learn about our project via our [https://www.softwareheritage.org/ main website] and the actual [https://archive.softwareheritage.org/ source code archive]&lt;br /&gt;
* Check out the index of our [https://www.softwareheritage.org/community/developers/ resources for developers]&lt;br /&gt;
* In particular make sure to have a look at:&lt;br /&gt;
** our [https://wiki.softwareheritage.org/ public wiki]&lt;br /&gt;
** our [https://docs.softwareheritage.org/devel/ documentation index]&lt;br /&gt;
* Technical setup:&lt;br /&gt;
** Create an account on our [https://forge.softwareheritage.org development forge]&lt;br /&gt;
** Familiarize yourself with our [[Code review in Phabricator|code review workflow]]&lt;br /&gt;
** Make a simple change to the documentation of any one of our [https://docs.softwareheritage.org/devel/ software components] and submit it as a [https://forge.softwareheritage.org/differential/ diff] for code review, following the above workflow. Feel free to submit any patch you think it might be useful.&lt;br /&gt;
&lt;br /&gt;
=== What to include in your application ===&lt;br /&gt;
&lt;br /&gt;
Make sure that your application includes the following information:&lt;br /&gt;
&lt;br /&gt;
* Describe the '''specific project''' you want to work on. What do you want to achieve? Why is it important? Why is it useful for Software Heritage? The project might be one of the project ideas that we have prepared below, or something else entirely that you want to contribute to Software Heritage. Your pet peeve, surprise us!&lt;br /&gt;
* Detail your '''work plan''': a brief description of how you plan to go about your project, including a list of  ''deliverables'' and a ''timeline'' of when do you expect them to be available.&lt;br /&gt;
* Include a reference to '''the diff''' you submitted before applying (see the &amp;quot;Before you apply&amp;quot; section above).&lt;br /&gt;
&lt;br /&gt;
== Ideas list ==&lt;br /&gt;
&lt;br /&gt;
Below you can find a list of project ideas that are good options for a&lt;br /&gt;
reasonably sized GSoD project.  They are just suggestion though, don't feel&lt;br /&gt;
obliged to pick one of them if there is nothing that fits your taste and&lt;br /&gt;
abilities.  Feel free to propose something else that you are excited about and&lt;br /&gt;
that contributes to improve the Software Heritage documentation: we will be&lt;br /&gt;
happy to consider it!&lt;br /&gt;
&lt;br /&gt;
=== Reorganize developer documentation as tutorials/how-to/discussions/references ===&lt;br /&gt;
&lt;br /&gt;
Our [https://docs.softwareheritage.org/devel/ developer documentation] is not particularly structured, making it hard to use it properly.&lt;br /&gt;
We would like to reorganize it following the [https://www.youtube.com/watch?v=t4vKPhjcMZg tutorials/how-to/discussions/references taxonomy], or something equally sensible.&lt;br /&gt;
&lt;br /&gt;
This project will require discussing the intended use cases of this page, conceptually structure the future version of it, implement it, and (re)write suitable language to introduce the various parts.&lt;br /&gt;
&lt;br /&gt;
=== Write a high-level technical overview of the project, data model, and data flow ===&lt;br /&gt;
&lt;br /&gt;
We have described the data model and archival data flow of Software Heritage in [https://upsilon.cc/~zack/research/publications/ipres-2018-doi.pdf various] [https://upsilon.cc/~zack/research/publications/cacm-2018-software-heritage.pdf scientific] [https://upsilon.cc/~zack/research/publications/msr-2019-swh.pdf papers], but haven't really worked on a general, high-level, technical presentation of it that targets developers.&lt;br /&gt;
The current [https://docs.softwareheritage.org/devel/swh-model/data-model.html#data-model data model documentation] has been copy-pasted from scientific papers.&lt;br /&gt;
We want to revisit it, to make sure the language description is suitable for a more general technical public, possibly complementing it with more abstract data-structure descriptions and cross-references to the code implementing the model and the workflow in the actual implementation.&lt;br /&gt;
&lt;br /&gt;
=== Design and document consistent writing conventions for Python docstrings ===&lt;br /&gt;
&lt;br /&gt;
The docstrings of the [https://docs.softwareheritage.org/devel/#components various software components] in the Software Heritage stack are not very consistent.&lt;br /&gt;
They have been written by a number of different developers, with varying degrees of English proficiency.&lt;br /&gt;
&lt;br /&gt;
We would like to document, as a set of writing guidelines, how to write them to the benefits of current and future developers.&lt;br /&gt;
The guidelines should include what to write in docstrings and whatnot, writing styles and guidance, examples of good and bad content for them, etc.&lt;br /&gt;
&lt;br /&gt;
To the extent it is possible, this project will also explore automating checks for guidelines conformance, in the form of basic structural (e.g., are all function parameters documented? is the function name correct? etc.) and writing checks (e.g., spellchecking, grammar checking, etc.).&lt;br /&gt;
&lt;br /&gt;
=== Revamp new (code) contributor tutorial ===&lt;br /&gt;
&lt;br /&gt;
The current [https://docs.softwareheritage.org/devel/#getting-started getting started documentation] include two overlapping documents: one about how to ''run'' a local instance of the full Software Heritage stack, another about how to create a local setup for current and future project ''developers''.&lt;br /&gt;
We would like to review those documents for consistency and, more generally, revamp documentation that will help wannabe code contributors to quickly get started and submit patches.&lt;br /&gt;
&lt;br /&gt;
As part of this project we might also want to revamp glossary, data model and other cross-cutting documents that help newcomers understand where-is-what in the code base and how to practically prepare and send us patches.&lt;br /&gt;
&lt;br /&gt;
=== Restructure wiki landing page as main community entry point ===&lt;br /&gt;
&lt;br /&gt;
Our [[Main_Page|public wiki landing page]] is, well, a mess.&lt;br /&gt;
It has grown inorganically by accumulating links to pages used more as working drafts than reasonably structured documents.&lt;br /&gt;
We would like to restructure it to cater for various use cases (users, developers, students, curators, scientists, etc.) in the attempt of making it the primary entry point into Software Heritage for all community members.&lt;br /&gt;
&lt;br /&gt;
This project will require discussing the various uses cases, design a suitable page layout, implement it in the wiki, write the introductory parts for each path into the wiki.&lt;br /&gt;
A related task, if time permits, will be to reorganize the wiki [[Special:Categories|category taxonomy]] and document when/how to use the available categories in the future.&lt;br /&gt;
&lt;br /&gt;
Working on this project will require familiarity with [https://www.mediawiki.org/wiki/MediaWiki Mediawiki] and its markup language.&lt;br /&gt;
&lt;br /&gt;
== Contact ==&lt;br /&gt;
&lt;br /&gt;
GSoD applicants are encouraged to get in touch with the Software Heritage community using the standard development communication channels, i.e.:&lt;br /&gt;
&lt;br /&gt;
* the #swh-devel IRC channel on [https://freenode.net Freenode]&lt;br /&gt;
* the [https://sympa.inria.fr/sympa/info/swh-devel swh-devel mailing list]&lt;br /&gt;
&lt;br /&gt;
See our [https://www.softwareheritage.org/community/developers/ development information page] for more details.&lt;br /&gt;
&lt;br /&gt;
== Timeline ==&lt;br /&gt;
&lt;br /&gt;
See the official [https://developers.google.com/season-of-docs/docs/timeline Google Season of Docs timeline].&lt;/div&gt;</summary>
		<author><name>Archit agrawal</name></author>
	</entry>
</feed>