From Software Heritage Wiki
Revision as of 15:07, 25 April 2017 by Morane (talk | contribs)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to: navigation, search is an initiative sponsored by Google, Microsoft, Yahoo and Yandex that promotes a very pragmatic approach to the semantic web/linked data. the vocabulary is recognized the search engines listed above.

At the time of writing over 10 million sites use to markup content, and many applications of the involved companies (e.g., GMail, Google Calendar) exploit metadata to provide added value features (e.g., automatically interpreting reservation details to add events to your calendar).


The ontology has, at the time of writing, relatively little support for software-related entities, most notably:

The hierarchical organization of show that all three are subclasses of CreativeWork which is a subclass of Thing. SoftwareApplication and SoftwareSourceCode are subclasses of code; MobileApplication and WebApplication are subclasses of SoftwareApplication.


Properties common to Code, SoftwareApplication and SoftwareSourceCode

'about', 'accessibilityAPI', 'accessibilityControl', 'accessibilityFeature', 'accessibilityHazard', 'accountablePerson', 'additionalType', 'aggregateRating', 'alternateName', 'alternativeHeadline', 'associatedMedia', 'audience', 'audio', 'author', 'award', 'character', 'citation', 'comment', 'commentCount', 'contentLocation', 'contentRating', 'contributor', 'copyrightHolder', 'copyrightYear', 'creator', 'dateCreated', 'dateModified', 'datePublished', 'description', 'disambiguatingDescription', 'discussionUrl', 'editor', 'educationalAlignment', 'educationalUse', 'encoding', 'exampleOfWork', 'fileFormat', 'funder', 'genre', 'hasPart', 'headline', 'image', 'inLanguage', 'interactionStatistic', 'interactivityType', 'isAccessibleForFree', 'isBasedOn', 'isFamilyFriendly', 'isPartOf', 'keywords', 'learningResourceType', 'license', 'locationCreated', 'mainEntity', 'mainEntityOfPage', 'mentions', 'name', 'offers', 'position', 'potentialAction', 'producer', 'provider', 'publication', 'publisher', 'publishingPrinciples', 'recordedAt', 'releasedEvent', 'review', 'sameAs', 'schemaVersion', 'sourceOrganization', 'spatialCoverage', 'sponsor', 'temporalCoverage', 'text', 'thumbnailUrl', 'timeRequired', 'translator', 'typicalAgeRange', 'url', 'version', 'video', 'workExample' We noted that the propreties in Code are a subset to SoftwareApplication and SoftwareSourceCode.

Properties common to SoftwareSourceCode and SoftwareApplication


Properties only in SoftwareApplication

'installUrl', 'memoryRequirements', 'processorRequirements', 'countriesSupported', 'featureList', 'availableOnDevice', 'applicationSubCategory', 'interactionService', 'applicationSuite', 'softwareHelp', 'countriesNotSupported', 'actionApplication', 'applicationCategory', 'softwareRequirements', 'screenshot', 'releaseNotes', 'softwareAddOn', 'storageRequirements', 'fileSize', 'permissions', 'downloadUrl', 'softwareVersion', 'supportingData', 'operatingSystem'

Properties only in SoftwareSourceCode

'codeRepository', 'runtimePlatform', 'codeSampleType', 'programmingLanguage'

Websites and formats using

The usage of classes when testing web pages with the google's structured data testing tool where software projects, applications or code are presented:




  • Github
  • CodeMeta initiative uses classes for Person and for SoftwareSourceCode in a Json-ld format

Linked Data and vocabulary can be used on a website page to link data with an appropriate syntax (Microdata, RDFa or JSON-LD). A short getting started article is available on

Using enables rich search results on various search engines like Google or Yahoo, listing in Google's knowledge graph and other possibilities with structured data.[1]

For the Software Heritage user inerface a markup syntax and vocabulary are yet to be decided.