Crawler

Imot bg crawler built with python

Jan 23, 2022 1 min read

To use

Create virtual environment for the project using Python 3.8+
Install requirements with pip install -r requirements.txt
Update search URLs in the file ./imot_bg_crawler/input.yaml
When done, check with http://www.yamllint.com/ if the input file is okay.
Run spider for the desired website. If you do not want logs add --nolog in the end of the command
When finished, check the ./imot_bg_crawler/output_files folder for the results.
Enjoy.

Spiders

Imot.bg – scrapy crawl imot.bg
Imoti.com – scrapy crawl imoti.com‘

Settings

SKIP_EXISTING – does not save data if already saved, default True

PER_ITEM_RESULT – saves every item in a separate folder, default True

PER_ITEM_DOWNLOAD_IMAGES – if PER_ITEM_RESULT is enabled, marks if crawler will download item images, default True

GitHub

View Github

John was the first writer to have joined pythonawesome.com. He has since then inculcated very effective writing and reviewing culture at pythonawesome which rivals have found impossible to imitate.

Imot bg crawler built with python

To use

Spiders

Settings

GitHub

John

Python library that facilitates the use of some features of the Anchore API

Transparent & click through tkinter window. WINDOWS ONLY

To use

Spiders

Settings

GitHub

Python library that facilitates the use of some features of the Anchore API

Transparent & click through tkinter window. WINDOWS ONLY

You might also like...