Go to file
2026-04-02 12:56:16 -04:00
LICENSE Initial commit 2026-04-02 12:47:05 -04:00
README.md Update README.md 2026-04-02 12:56:16 -04:00
SCRAPE_LONDON.SH Add SCRAPE_LONDON.SH 2026-04-02 12:48:03 -04:00
websites.csv Example CSV 2026-04-02 12:50:03 -04:00

Scrape_eScribe

This bash script will scrape meetings from the eScribe meetings platform. websites.csv holds an index of domains to crawl. The format is as follows:

"<eScribe domain>","<output directory>","<leave empty, this entry is used by other tools>"

As an example, an entry might look like this:

"https://pub-london.escribemeetings.com/", "LondonArchive", ""

Files will be output to ./LondonArchive/Meetings/.

The basic structure of the output files is:

./LondonArchive/Meetings/<board/committee name>/<year>/<mm-dd>/
                                                              |- <agenda>.pdf
                                                              |- <minutes>.pdf
                                                              \- Attachments/
                                                                            |- <attachment 1>.pdf
                                                                            |- <attachment 2>.pdf
                                                                            \- etc etc