|
|
5cc9297a01
|
chore: update filter middleware
|
2022-06-27 00:06:11 +01:00 |
|
|
|
a07bb6f5bd
|
chore: add filter middleware
|
2022-06-27 00:06:07 +01:00 |
|
|
|
55e4ff7722
|
chore: add remove citations to item loader
|
2022-06-27 00:06:03 +01:00 |
|
|
|
6fcae89c5d
|
chore: update itemloader and exclusion filter for flags
|
2022-06-27 00:05:59 +01:00 |
|
|
|
ceb7aa5b08
|
chore: update flag spider & add item loader for processing
|
2022-06-27 00:05:53 +01:00 |
|
|
|
9ef6f94516
|
chore: fix index error in flags spider
|
2022-06-26 17:50:21 +01:00 |
|
|
|
44a8365c53
|
chore: try alternative flag spider
|
2022-06-26 17:46:04 +01:00 |
|
|
|
b5eec4550d
|
chore: update flags spider
|
2022-06-26 17:27:37 +01:00 |
|
|
|
42df945c25
|
chore: add bash script to run flags spider
|
2022-06-26 17:19:23 +01:00 |
|
|
|
eda5e67058
|
chore: add flags spider
|
2022-06-26 17:18:31 +01:00 |
|
|
|
7e95992654
|
chore: add missing import
|
2022-06-26 16:34:44 +01:00 |
|
|
|
1a2851f46c
|
chore: remove extra import
|
2022-06-26 16:34:09 +01:00 |
|
|
|
7ac4817c09
|
chore: add bash script to run capitals spider
|
2022-06-26 16:34:09 +01:00 |
|
|
|
0d2b379a28
|
chore: add capitals spider
|
2022-06-26 16:34:09 +01:00 |
|
|
|
101f4a4080
|
chore: update playground
|
2022-06-25 23:22:59 +01:00 |
|
|
|
d51e803baf
|
chore: dont set file url if anthem is missing
|
2022-06-25 17:17:00 +01:00 |
|
|
|
c4f785286c
|
chore: change spider name for anthems spider
|
2022-06-25 17:03:57 +01:00 |
|
|
|
a6688a5699
|
chore: add missing comment
|
2022-06-25 17:03:09 +01:00 |
|
|
|
1156976823
|
chore: add anthems spider
|
2022-06-25 17:01:53 +01:00 |
|
|
|
e865018fd9
|
chore: change xml for flag description url
|
2022-06-24 22:59:13 +01:00 |
|
|
|
fa26c99ba5
|
chore: add flag image url to saved item
|
2022-06-24 22:43:15 +01:00 |
|
|
|
17b0462da5
|
chore: fix flag file url
|
2022-06-24 22:20:34 +01:00 |
|
|
|
cb82bd226f
|
chore: add https to flag file url
|
2022-06-24 22:00:14 +01:00 |
|
|
|
59df2f02dd
|
chore: comment out backup spider
|
2022-06-24 21:52:54 +01:00 |
|
|
|
f9d364506a
|
chore: decrease download delay
|
2022-06-24 21:52:14 +01:00 |
|
|
|
cbf4129db4
|
chore: fix flag description url
|
2022-06-24 21:50:29 +01:00 |
|
|
|
3108ab1c1f
|
chore: change indent
|
2022-06-24 21:00:00 +01:00 |
|
|
|
34d6980cac
|
chore: remove anthem from scraper
|
2022-06-24 20:48:56 +01:00 |
|
|
|
0bd759a002
|
chore: try new xpath for anthem
|
2022-06-24 02:14:45 +01:00 |
|
|
|
4b305f757c
|
chore: fix anthem url
|
2022-06-24 01:17:06 +01:00 |
|
|
|
f0b675d4ef
|
chore: fix anthem file
|
2022-06-24 00:51:15 +01:00 |
|
|
|
2c3f6cb3d0
|
chore: add missing yield
|
2022-06-24 00:24:19 +01:00 |
|
|
|
7e1b87ff06
|
chore: debug anthem
|
2022-06-24 00:08:38 +01:00 |
|
|
|
4b36736990
|
chore: add jupyterlab
|
2022-06-23 23:40:47 +01:00 |
|
|
|
99c45ff668
|
chore: remove indent
|
2022-06-23 23:40:33 +01:00 |
|
|
|
4a47fd2d35
|
chore: fix incorrect key
|
2022-06-23 22:51:49 +01:00 |
|
|
|
71b8cebc42
|
chore: try get anthem from page
|
2022-06-23 22:51:49 +01:00 |
|
|
|
a59d34d180
|
chore: disable dev filter for downloads
|
2022-06-23 02:30:46 +01:00 |
|
|
|
96c99fdb1f
|
chore: readd country html to download
|
2022-06-23 02:30:22 +01:00 |
|
|
|
1ab07ff396
|
chore: add .oga to anthem files to download
|
2022-06-23 02:25:06 +01:00 |
|
|
|
3ea83c2025
|
chore: add black to playground script
|
2022-06-23 02:24:45 +01:00 |
|
|
|
5581762c39
|
chore: change anthem download to .ogg instead of .mp3
|
2022-06-22 23:19:49 +01:00 |
|
|
|
c781e337b8
|
chore: add link to exporting feeds to dev docs
|
2022-06-22 21:47:51 +01:00 |
|
|
|
f8fa357de4
|
chore: save flags/anthems to own directories
|
2022-06-22 21:47:51 +01:00 |
|
|
|
3cb4b4ba46
|
chore: change anthem to store html
|
2022-06-22 21:47:51 +01:00 |
|
|
|
97be860627
|
chore: add json feeds output
|
2022-06-22 21:47:51 +01:00 |
|
|
|
522f766b49
|
chore: add download delay to settings
|
2022-06-22 21:47:51 +01:00 |
|
|
|
721114bf1b
|
chore: add ./data to .gitignore
|
2022-06-22 21:47:51 +01:00 |
|
|
|
9d91bb5898
|
chore: add dev playground
|
2022-06-22 20:39:49 +01:00 |
|
|
|
e49fa7a346
|
chore: add initial scrapy code
|
2022-06-22 20:39:41 +01:00 |
|