Agroforestry Species Switchboard 2.0 Scraper

contributions welcome
MIT License

Watch on GitHub
Star on GitHub
Tweet

Scrape plants scientific name information from Species Switchboard 2.0.

Requirements

How to run

  1. Install dependencies

    cp env.sample .env
    pipenv --python 3
    pipenv install
  2. Run

    pipenv run python main.py
  3. The result will be placed in a file named result.*.csv

Test Shell

pipenv run scrapy shell 'http://apps.worldagroforestry.org/products/switchboard/index.php/species_search/Acacia%20abyssinica'

Cleanup All Outputs

rm result.* && rm log.*

Special Cases

Case Link Note
ICRAF Databases Not Found Engelhardia spicata
Genus Found Forficula What to do next?
Multiple Species Found Alstonia spectabilis Get the matched species right?
Species Variant Found Engelhardtia spicata Need human to check
Similar Species Found Costus speciosus Need human to check

Contributing

  1. Fork this repo
  2. Develop
  3. Create pull request
  4. Tag @rizqirizqi for review
  5. Merge~~

License

GPL-3.0

GitHub

View Github