Skip to content

edsu/ginger

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ginger

Ginger is a set of programs for harvesting large sets of URLs in a distributed fashion using Amazon Web Services. Think of ginger as Web harvesting rebooted for the cloud, so that you can reasonably rent the machines when you need them, and can retire them when you don't. If you are wondering why "ginger" let's just say it's because @eikeon likes golang, AWS, the Web ... and ginger. We think you will too.

Run

Components

  • gw - a Web application and REST API
  • ??? - query for and queue work
  • ??? - checks the web for the resource
  • ??? - persists resource metadata
  • ??? - checks web archives for resource
  • ??? - example importer for checking external links in Wikipedia

Develop Build Status

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published