Update README.md
This commit is contained in:
parent
4596e0f6a4
commit
438ae651c2
20
README.md
20
README.md
@ -1,2 +1,22 @@
|
|||||||
# Scrape_eScribe
|
# Scrape_eScribe
|
||||||
|
|
||||||
|
This bash script will scrape meetings from the eScribe meetings platform. `websites.csv` holds an index of domains to crawl. The format is as follows:
|
||||||
|
```
|
||||||
|
"<eScribe domain>","<output directory>","<leave empty, this entry is used by other tools>"
|
||||||
|
```
|
||||||
|
As an example, an entry might look like this:
|
||||||
|
```
|
||||||
|
"https://pub-london.escribemeetings.com/", "LondonArchive", ""
|
||||||
|
```
|
||||||
|
Files will be output to `./LondonArchive/Meetings/`.
|
||||||
|
|
||||||
|
The basic structure of the output files is:
|
||||||
|
```
|
||||||
|
./LondonArchive/Meetings/<board/committee name>/<year>/<mm-dd>/
|
||||||
|
|- <agenda>.pdf
|
||||||
|
|- <minutes>.pdf
|
||||||
|
\- Attachments/
|
||||||
|
|- <attachment 1>.pdf
|
||||||
|
|- <attachment 2>.pdf
|
||||||
|
\- etc etc
|
||||||
|
```
|
||||||
Loading…
Reference in New Issue
Block a user