1
2
mirror of https://github.com/vimagick/dockerfiles synced 2024-06-25 16:28:40 +00:00
This commit is contained in:
kev 2015-05-28 12:21:17 +08:00
parent 918b9dae79
commit 51c715a63c
2 changed files with 11 additions and 3 deletions

@ -14,6 +14,7 @@ RUN apt-get update \
libtool \
python \
python-dev \
vim-tiny \
&& mkdir libxml2 \
&& curl -sSL ftp://xmlsoft.org/libxml2/libxml2-2.9.2.tar.gz | tar xz --strip 1 -C libxml2 \
&& cd libxml2 \
@ -33,8 +34,9 @@ RUN apt-get update \
&& cd .. \
&& rm -rf libxslt \
&& curl -sSL https://bootstrap.pypa.io/get-pip.py | python \
&& pip install scrapy==1.0.0rc1 \
&& pip install scrapy==0.24.6 \
&& curl -sSL https://github.com/scrapy/scrapy/raw/master/extras/scrapy_bash_completion -o /etc/bash_completion.d/scrapy_bash_completion \
&& echo 'source /etc/bash_completion.d/scrapy_bash_completion' >> /root/.bashrc \
&& apt-get remove -y autoconf \
build-essential \
libffi-dev \

@ -3,12 +3,18 @@
`Scrapy`: An open source and collaborative framework for extracting the data
you need from websites. In a fast, simple, yet extensible way.
You can create a scrapy (v1.0.0rc1) project on top of this image.
This image is based on `debian:jessie`, and it only takes 278.6 MB.
You can create a scrapy (v0.24.6) project on top of this image.
## HOW-TO
```
docker run --name scrapy -it vimagick/scrapy
$ docker run --name scrapy -it vimagick/scrapy
>>> scrapy startproject demo
>>> cd demo
>>> scrapy genspider example example.com
>>> scrapy edit example
>>> scrapy crawl example
```
## TODO-LIST