Skip to content

sim4life/fruit_scraping

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Fruit HTML scraping

This repo provides a basic script to scrape html doc and build JSON out of useful info present within.

###Instructions to run:

  1. Download and install Golang sdk, if not already, from https://golang.org/dl/
  2. Setup $GOPATH environment variable to some directory
  3. On your terminal create the relevant directories and navigate to it by running the following command

$ mkdir -p $GOPATH/src/github.com/sim4life && cd $_

  1. Now clone the repo in that directory by

$ git clone git@github.com:sim4life/fruit_scraping.git

  1. Then download dependencies by

$ go get golang.org/x/net/publicsuffix
$ go get github.com/PuerkitoBio/goquery
$ go get github.com/djimenez/iconv-go

  1. To run

$ go run fruitsJSON.go

  1. To test

$ go test

I hope you enjoy the show.

About

This repo provides a basic script to scrape html doc and build json out of useful info

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages