1
2
mirror of https://github.com/vimagick/dockerfiles synced 2024-06-25 16:28:40 +00:00
This commit is contained in:
kev 2015-05-28 09:29:27 +08:00
parent 43c755047a
commit 686b92a06d
2 changed files with 17 additions and 3 deletions

@ -16,6 +16,7 @@ RUN apt-get update \
python-dev \
&& curl -sSL https://bootstrap.pypa.io/get-pip.py | python \
&& pip install scrapy==1.0.0rc1 \
&& curl -sSL https://github.com/scrapy/scrapy/raw/master/extras/scrapy_bash_completion -o /etc/bash_completion.d/scrapy_bash_completion \
&& apt-get remove -y build-essential \
libffi-dev \
libssl-dev \

@ -1,6 +1,19 @@
`Scrapy`: An open source and collaborative framework
for extracting the data you need from websites.
In a fast, simple, yet extensible way.
## WHAT-IS
`Scrapy`: An open source and collaborative framework for extracting the data
you need from websites. In a fast, simple, yet extensible way.
You can create a scrapy (v1.0.0rc1) project on top of this image.
## HOW-TO
```
docker run --name scrapy -it vimagick/scrapy
```
## TODO-LIST
- build [libxml2][1]/[libxslt][2] from source
[1]: http://www.xmlsoft.org/downloads.html
[2]: http://git.gnome.org/browse/libxslt/