kwang2049 commited on
Commit
8d01c7e
1 Parent(s): 5ff9364
Files changed (2) hide show
  1. README.md +72 -1
  2. local_run.py +3 -1881
README.md CHANGED
@@ -4,4 +4,75 @@ tags:
4
  - endpoints-template
5
  license: bsd-3-clause
6
  library_name: generic
7
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
4
  - endpoints-template
5
  license: bsd-3-clause
6
  library_name: generic
7
+ ---
8
+
9
+ # Coreference Resolution for Long Documents
10
+ Modified coreference resolution model from [BERT for Coreference Resolution: Baselines and Analysis](https://aclanthology.org/D19-1588/) for handling long documents (~40K words) efficiently (500K words/s on a NVIDIA Tesla V100). This modified model was used in [DAPR: A Benchmark on Document-Aware Passage Retrieval](https://arxiv.org/abs/2305.13915).
11
+
12
+ ## Usage
13
+ ### API call
14
+ One can call the Hugging's Inference Endpoints API directly:
15
+ ```python
16
+ import requests
17
+ import time
18
+
19
+ API_URL = "https://api-inference.huggingface.co/models/kwang2049/long-coref"
20
+ headers = {"Authorization": "Bearer ${YOUR_HUGGINGFACE_ACCESS_TOKEN}"}
21
+
22
+
23
+ def query(payload):
24
+ while True:
25
+ response = requests.post(API_URL, headers=headers, json=payload)
26
+ if response.status_code == 503:
27
+ time.sleep(5)
28
+ print(response.json()["error"])
29
+ continue
30
+ elif response.status_code == 200:
31
+ return response.json()
32
+ else:
33
+ error_message = f"{response.status_code}: {response.json['error']}."
34
+ raise requests.HTTPError(error_message)
35
+
36
+
37
+ doc = [
38
+ "The Half Moon is a public house and music venue in Putney, London. It is one of the city's longest running live music venues, and has hosted live music every night since 1963.",
39
+ "The pub is on the south side of the Lower Richmond road, in the London Borough of Wandsworth."
40
+ ]
41
+
42
+ PARAGRAPH_DELIMITER = "\n\n"
43
+
44
+ output = query(
45
+ {
46
+ "inputs": PARAGRAPH_DELIMITER.join(doc),
47
+ }
48
+ )
49
+ print(output)
50
+ # {
51
+ # 'pargraph_sentences': ...,
52
+ # 'top_spans': ...,
53
+ # 'antecedents': ...
54
+ # }
55
+ ```
56
+ ### Local run
57
+ One can also run the code of the repo on a local machine:
58
+
59
+ ```bash
60
+ # Clone the repo
61
+ git lfs install
62
+ git clone https://huggingface.co/kwang2049/long-coref
63
+ cd long-coref
64
+ pip install -r requirements.txt
65
+ python local_run.py
66
+ ```
67
+
68
+ ## Citation
69
+ If you use the repo, feel free to cite our publication [DAPR: A Benchmark on Document-Aware Passage Retrieval](https://arxiv.org/abs/2305.13915):
70
+ ```bibtex
71
+ @article{wang2023dapr,
72
+ title = "DAPR: A Benchmark on Document-Aware Passage Retrieval",
73
+ author = "Kexin Wang and Nils Reimers and Iryna Gurevych",
74
+ journal= "arXiv preprint arXiv:2305.13915",
75
+ year = "2023",
76
+ url = "https://arxiv.org/abs/2305.13915",
77
+ }
78
+ ```
local_run.py CHANGED
@@ -6,1891 +6,13 @@ my_handler = PreTrainedPipeline(path=".")
6
  doc = [
7
  "The Half Moon is a public house and music venue in Putney, London. It is one of the city's longest running live music venues, and has hosted live music every night since 1963.",
8
  "The pub is on the south side of the Lower Richmond road, in the London Borough of Wandsworth.",
9
- "The Half Moon is one of London's longest running, and most respected live music venues. Since the early 1960s, some of the biggest names in popular music have performed there, including The Rolling Stones, and The Who. The venue has hosted live music every night since 1963.It all began with the folk and blues sessions started by Gerry Lockran, Royd Rivers and Cliff Aungier in 1963. 'Folksville', as the sessions were called, featured new British and European artists alongside established American blues-men. These included Sonny Terry and Brownie McGhee, Champion Jack Dupree and Arthur Crudup. British acts included Ralph McTell, John Martyn, Bert Jansch and Roy Harper.John Mayall's Bluesbreakers, Alexis Korner, and The Yardbirds made an appearance here and at other South London venues such as The Eel Pie Club and Crawdaddy Club.As blues and folk thrived, bringing Fairport Convention and Van Morrison, so other genres began to appear. From the psychedelia of The Bonzo Dog Doo Dah Band and Bob Kerr's Whoopee Band; to 1960s mod groups The Pretty Things and Nashville Teens; to early pub-rock acts such as Dr. Feelgood; the Half Moon was the centre of not only the emerging music scenes but continued to host the big names.It was never just a \"passing through\" venue. Residencies at the Half Moon have included Elvis Costello (who would play a couple of times a month in the mid-1970s for 50 pence and a plate of sandwiches) and Steve Marriott of Small Faces fame. John Martyn returned decades after his first Half Moon gigs to a week-long residency, and both Tim Rose and Roy Harper did the same. On 24 July 2007, Welsh-language folk guitarist Meic Stevens performed his first London gig in over 30 years at the Half Moon. Other memorable gigs at the Half Moon include k.d. lang's first UK appearance, Kate Bush's first ever public performance and a surprise appearance by Nick Cave. The Hamsters played their last shows at the Half Moon in 2012, signing off on a 25-year career with five shows over one weekend.The Half Moon has always been synonymous with The Rolling Stones, whose most recent visit was a private event held in May 2000. As well as performing as the band, individual members have appeared here in various side-projects, and have also used the venue for rehearsal space. In January 2010, the Half Moon almost closed due to failing sales, rising rates and the recession, but they received hundreds of signatures and a Facebook campaign of 6,500 people. Musicians such as The X Factor finalist Jamie Archer, Eddi Reader and Simon Fowler supported the petition as well. As part of its revival, the Half Moon started serving food.In 2012, the Half Moon was bought by Geronimo Inns, whose parent company is Young & Co.",
10
- "Artists who have performed or recorded at the venue since the mid-1960s include the Rolling Stones, The Who, The Small Faces, Kasabian, Sisteray, Chris Bell, Ralph McTell, GoodLuck, John Martyn, John Mayall's Bluesbreakers, Alexis Korner, The Yardbirds, Bob Kerr's Whoopee Band, Morrissey–Mullen – who had a residency there of several years' standing, Rocket 88, Fairport Convention, Bonzo Dog Doo-Dah Band, Roy Harper, Van Morrison,Man, Danny Thompson, Dr. Feelgood, Elvis Costello, Meic Stevens, Finley Quaye, I Am Kloot, Starlite Campbell Band, Beverley Craven, Bo Diddley, John Otway, Tim Rose, Amy McDonald, Catfish Keith, The MonaLisa Twins, as well as k.d. lang's first UK appearance, and Kate Bush's first public performance.It has also hosted comedy, including Billy Connolly, Andy Parsons, Harry Hill, Rufus Hound, Shappi Khorsandi, Norman Lovett, Bob Mills, Milton Jones, Al Murray, Stewart Lee, Richard Herring, Jack Whitehall, Alistair McGowan, Katherine Ryan, Cardinal Burns, Reginald D. Hunter, Stewart Francis, Bridget Christie, Josh Widdicombe, Sara Pascoe, Rob Beckett, Sean Hughes, Kevin Eldon, Henning Wehn, Hal Cruttenden, Holly Walsh, Danny Bhoy, Aisling Bea and James Acaster.",
11
- "Live at Half Moon Putney – The De Luxe Blues Band (1981)An Evening with Meic Stevens: Recorded Live in London – Meic Stevens (2007) (Sunbeam Records, SBRCD5033)Live at the Half Moon – Catfish Keith (2009) (Fish Tail Records)Remember, on stage at The Half Moon – Latin Quarter (2023) (Westpark Music)",
12
- "The pub is served by Transport for London buses 22, 265, 378, 485 which stop on the Lower Richmond road. Putney Bridge tube station (District line) is a 12 minute walk over Putney Bridge and Putney railway station (Southwestern Railway) is a 12 minute walk up Putney High Street.The Santander Cycles Putney Pier docking station is a three minute walk.",
13
- "Official website",
14
  ]
15
- doc_string = "\n\n".join(doc)
 
16
 
17
  # prepare sample payload
18
- payload = {"inputs": doc_string}
19
  prediction = my_handler(payload)
20
 
21
  # show results
22
  print(prediction)
23
-
24
- # {
25
- # "pargraph_sentences": [
26
- # [
27
- # [
28
- # {"text": "The", "text_with_ws": "The "},
29
- # {"text": "Half", "text_with_ws": "Half "},
30
- # {"text": "Moon", "text_with_ws": "Moon "},
31
- # {"text": "is", "text_with_ws": "is "},
32
- # {"text": "a", "text_with_ws": "a "},
33
- # {"text": "public", "text_with_ws": "public "},
34
- # {"text": "house", "text_with_ws": "house "},
35
- # {"text": "and", "text_with_ws": "and "},
36
- # {"text": "music", "text_with_ws": "music "},
37
- # {"text": "venue", "text_with_ws": "venue "},
38
- # {"text": "in", "text_with_ws": "in "},
39
- # {"text": "Putney", "text_with_ws": "Putney"},
40
- # {"text": ",", "text_with_ws": ", "},
41
- # {"text": "London", "text_with_ws": "London"},
42
- # {"text": ".", "text_with_ws": ". "},
43
- # ],
44
- # [
45
- # {"text": "It", "text_with_ws": "It "},
46
- # {"text": "is", "text_with_ws": "is "},
47
- # {"text": "one", "text_with_ws": "one "},
48
- # {"text": "of", "text_with_ws": "of "},
49
- # {"text": "the", "text_with_ws": "the "},
50
- # {"text": "city", "text_with_ws": "city"},
51
- # {"text": "'s", "text_with_ws": "'s "},
52
- # {"text": "longest", "text_with_ws": "longest "},
53
- # {"text": "running", "text_with_ws": "running "},
54
- # {"text": "live", "text_with_ws": "live "},
55
- # {"text": "music", "text_with_ws": "music "},
56
- # {"text": "venues", "text_with_ws": "venues"},
57
- # {"text": ",", "text_with_ws": ", "},
58
- # {"text": "and", "text_with_ws": "and "},
59
- # {"text": "has", "text_with_ws": "has "},
60
- # {"text": "hosted", "text_with_ws": "hosted "},
61
- # {"text": "live", "text_with_ws": "live "},
62
- # {"text": "music", "text_with_ws": "music "},
63
- # {"text": "every", "text_with_ws": "every "},
64
- # {"text": "night", "text_with_ws": "night "},
65
- # {"text": "since", "text_with_ws": "since "},
66
- # {"text": "1963", "text_with_ws": "1963"},
67
- # {"text": ".", "text_with_ws": "."},
68
- # ],
69
- # ],
70
- # [
71
- # [
72
- # {"text": "The", "text_with_ws": "The "},
73
- # {"text": "pub", "text_with_ws": "pub "},
74
- # {"text": "is", "text_with_ws": "is "},
75
- # {"text": "on", "text_with_ws": "on "},
76
- # {"text": "the", "text_with_ws": "the "},
77
- # {"text": "south", "text_with_ws": "south "},
78
- # {"text": "side", "text_with_ws": "side "},
79
- # {"text": "of", "text_with_ws": "of "},
80
- # {"text": "the", "text_with_ws": "the "},
81
- # {"text": "Lower", "text_with_ws": "Lower "},
82
- # {"text": "Richmond", "text_with_ws": "Richmond "},
83
- # {"text": "road", "text_with_ws": "road"},
84
- # {"text": ",", "text_with_ws": ", "},
85
- # {"text": "in", "text_with_ws": "in "},
86
- # {"text": "the", "text_with_ws": "the "},
87
- # {"text": "London", "text_with_ws": "London "},
88
- # {"text": "Borough", "text_with_ws": "Borough "},
89
- # {"text": "of", "text_with_ws": "of "},
90
- # {"text": "Wandsworth", "text_with_ws": "Wandsworth"},
91
- # {"text": ".", "text_with_ws": "."},
92
- # ]
93
- # ],
94
- # [
95
- # [
96
- # {"text": "The", "text_with_ws": "The "},
97
- # {"text": "Half", "text_with_ws": "Half "},
98
- # {"text": "Moon", "text_with_ws": "Moon "},
99
- # {"text": "is", "text_with_ws": "is "},
100
- # {"text": "one", "text_with_ws": "one "},
101
- # {"text": "of", "text_with_ws": "of "},
102
- # {"text": "London", "text_with_ws": "London"},
103
- # {"text": "'s", "text_with_ws": "'s "},
104
- # {"text": "longest", "text_with_ws": "longest "},
105
- # {"text": "running", "text_with_ws": "running"},
106
- # {"text": ",", "text_with_ws": ", "},
107
- # {"text": "and", "text_with_ws": "and "},
108
- # {"text": "most", "text_with_ws": "most "},
109
- # {"text": "respected", "text_with_ws": "respected "},
110
- # {"text": "live", "text_with_ws": "live "},
111
- # {"text": "music", "text_with_ws": "music "},
112
- # {"text": "venues", "text_with_ws": "venues"},
113
- # {"text": ".", "text_with_ws": ". "},
114
- # ],
115
- # [
116
- # {"text": "Since", "text_with_ws": "Since "},
117
- # {"text": "the", "text_with_ws": "the "},
118
- # {"text": "early", "text_with_ws": "early "},
119
- # {"text": "1960s", "text_with_ws": "1960s"},
120
- # {"text": ",", "text_with_ws": ", "},
121
- # {"text": "some", "text_with_ws": "some "},
122
- # {"text": "of", "text_with_ws": "of "},
123
- # {"text": "the", "text_with_ws": "the "},
124
- # {"text": "biggest", "text_with_ws": "biggest "},
125
- # {"text": "names", "text_with_ws": "names "},
126
- # {"text": "in", "text_with_ws": "in "},
127
- # {"text": "popular", "text_with_ws": "popular "},
128
- # {"text": "music", "text_with_ws": "music "},
129
- # {"text": "have", "text_with_ws": "have "},
130
- # {"text": "performed", "text_with_ws": "performed "},
131
- # {"text": "there", "text_with_ws": "there"},
132
- # {"text": ",", "text_with_ws": ", "},
133
- # {"text": "including", "text_with_ws": "including "},
134
- # {"text": "The", "text_with_ws": "The "},
135
- # {"text": "Rolling", "text_with_ws": "Rolling "},
136
- # {"text": "Stones", "text_with_ws": "Stones"},
137
- # {"text": ",", "text_with_ws": ", "},
138
- # {"text": "and", "text_with_ws": "and "},
139
- # {"text": "The", "text_with_ws": "The "},
140
- # {"text": "Who", "text_with_ws": "Who"},
141
- # {"text": ".", "text_with_ws": ". "},
142
- # ],
143
- # [
144
- # {"text": "The", "text_with_ws": "The "},
145
- # {"text": "venue", "text_with_ws": "venue "},
146
- # {"text": "has", "text_with_ws": "has "},
147
- # {"text": "hosted", "text_with_ws": "hosted "},
148
- # {"text": "live", "text_with_ws": "live "},
149
- # {"text": "music", "text_with_ws": "music "},
150
- # {"text": "every", "text_with_ws": "every "},
151
- # {"text": "night", "text_with_ws": "night "},
152
- # {"text": "since", "text_with_ws": "since "},
153
- # {"text": "1963.It", "text_with_ws": "1963.It "},
154
- # {"text": "all", "text_with_ws": "all "},
155
- # {"text": "began", "text_with_ws": "began "},
156
- # {"text": "with", "text_with_ws": "with "},
157
- # {"text": "the", "text_with_ws": "the "},
158
- # {"text": "folk", "text_with_ws": "folk "},
159
- # {"text": "and", "text_with_ws": "and "},
160
- # {"text": "blues", "text_with_ws": "blues "},
161
- # {"text": "sessions", "text_with_ws": "sessions "},
162
- # {"text": "started", "text_with_ws": "started "},
163
- # {"text": "by", "text_with_ws": "by "},
164
- # {"text": "Gerry", "text_with_ws": "Gerry "},
165
- # {"text": "Lockran", "text_with_ws": "Lockran"},
166
- # {"text": ",", "text_with_ws": ", "},
167
- # {"text": "Royd", "text_with_ws": "Royd "},
168
- # {"text": "Rivers", "text_with_ws": "Rivers "},
169
- # {"text": "and", "text_with_ws": "and "},
170
- # {"text": "Cliff", "text_with_ws": "Cliff "},
171
- # {"text": "Aungier", "text_with_ws": "Aungier "},
172
- # {"text": "in", "text_with_ws": "in "},
173
- # {"text": "1963", "text_with_ws": "1963"},
174
- # {"text": ".", "text_with_ws": ". "},
175
- # {"text": "'", "text_with_ws": "'"},
176
- # ],
177
- # [
178
- # {"text": "Folksville", "text_with_ws": "Folksville"},
179
- # {"text": "'", "text_with_ws": "'"},
180
- # {"text": ",", "text_with_ws": ", "},
181
- # {"text": "as", "text_with_ws": "as "},
182
- # {"text": "the", "text_with_ws": "the "},
183
- # {"text": "sessions", "text_with_ws": "sessions "},
184
- # {"text": "were", "text_with_ws": "were "},
185
- # {"text": "called", "text_with_ws": "called"},
186
- # {"text": ",", "text_with_ws": ", "},
187
- # {"text": "featured", "text_with_ws": "featured "},
188
- # {"text": "new", "text_with_ws": "new "},
189
- # {"text": "British", "text_with_ws": "British "},
190
- # {"text": "and", "text_with_ws": "and "},
191
- # {"text": "European", "text_with_ws": "European "},
192
- # {"text": "artists", "text_with_ws": "artists "},
193
- # {"text": "alongside", "text_with_ws": "alongside "},
194
- # {"text": "established", "text_with_ws": "established "},
195
- # {"text": "American", "text_with_ws": "American "},
196
- # {"text": "blues", "text_with_ws": "blues"},
197
- # {"text": "-", "text_with_ws": "-"},
198
- # {"text": "men", "text_with_ws": "men"},
199
- # {"text": ".", "text_with_ws": ". "},
200
- # ],
201
- # [
202
- # {"text": "These", "text_with_ws": "These "},
203
- # {"text": "included", "text_with_ws": "included "},
204
- # {"text": "Sonny", "text_with_ws": "Sonny "},
205
- # {"text": "Terry", "text_with_ws": "Terry "},
206
- # {"text": "and", "text_with_ws": "and "},
207
- # {"text": "Brownie", "text_with_ws": "Brownie "},
208
- # {"text": "McGhee", "text_with_ws": "McGhee"},
209
- # {"text": ",", "text_with_ws": ", "},
210
- # {"text": "Champion", "text_with_ws": "Champion "},
211
- # {"text": "Jack", "text_with_ws": "Jack "},
212
- # {"text": "Dupree", "text_with_ws": "Dupree "},
213
- # {"text": "and", "text_with_ws": "and "},
214
- # {"text": "Arthur", "text_with_ws": "Arthur "},
215
- # {"text": "Crudup", "text_with_ws": "Crudup"},
216
- # {"text": ".", "text_with_ws": ". "},
217
- # ],
218
- # [
219
- # {"text": "British", "text_with_ws": "British "},
220
- # {"text": "acts", "text_with_ws": "acts "},
221
- # {"text": "included", "text_with_ws": "included "},
222
- # {"text": "Ralph", "text_with_ws": "Ralph "},
223
- # {"text": "McTell", "text_with_ws": "McTell"},
224
- # {"text": ",", "text_with_ws": ", "},
225
- # {"text": "John", "text_with_ws": "John "},
226
- # {"text": "Martyn", "text_with_ws": "Martyn"},
227
- # {"text": ",", "text_with_ws": ", "},
228
- # {"text": "Bert", "text_with_ws": "Bert "},
229
- # {"text": "Jansch", "text_with_ws": "Jansch "},
230
- # {"text": "and", "text_with_ws": "and "},
231
- # {"text": "Roy", "text_with_ws": "Roy "},
232
- # {"text": "Harper", "text_with_ws": "Harper"},
233
- # {"text": ".", "text_with_ws": "."},
234
- # ],
235
- # [
236
- # {"text": "John", "text_with_ws": "John "},
237
- # {"text": "Mayall", "text_with_ws": "Mayall"},
238
- # {"text": "'s", "text_with_ws": "'s "},
239
- # {"text": "Bluesbreakers", "text_with_ws": "Bluesbreakers"},
240
- # {"text": ",", "text_with_ws": ", "},
241
- # {"text": "Alexis", "text_with_ws": "Alexis "},
242
- # {"text": "Korner", "text_with_ws": "Korner"},
243
- # {"text": ",", "text_with_ws": ", "},
244
- # {"text": "and", "text_with_ws": "and "},
245
- # {"text": "The", "text_with_ws": "The "},
246
- # {"text": "Yardbirds", "text_with_ws": "Yardbirds "},
247
- # {"text": "made", "text_with_ws": "made "},
248
- # {"text": "an", "text_with_ws": "an "},
249
- # {"text": "appearance", "text_with_ws": "appearance "},
250
- # {"text": "here", "text_with_ws": "here "},
251
- # {"text": "and", "text_with_ws": "and "},
252
- # {"text": "at", "text_with_ws": "at "},
253
- # {"text": "other", "text_with_ws": "other "},
254
- # {"text": "South", "text_with_ws": "South "},
255
- # {"text": "London", "text_with_ws": "London "},
256
- # {"text": "venues", "text_with_ws": "venues "},
257
- # {"text": "such", "text_with_ws": "such "},
258
- # {"text": "as", "text_with_ws": "as "},
259
- # {"text": "The", "text_with_ws": "The "},
260
- # {"text": "Eel", "text_with_ws": "Eel "},
261
- # {"text": "Pie", "text_with_ws": "Pie "},
262
- # {"text": "Club", "text_with_ws": "Club "},
263
- # {"text": "and", "text_with_ws": "and "},
264
- # {"text": "Crawdaddy", "text_with_ws": "Crawdaddy "},
265
- # {"text": "Club", "text_with_ws": "Club"},
266
- # {"text": ".", "text_with_ws": "."},
267
- # ],
268
- # [
269
- # {"text": "As", "text_with_ws": "As "},
270
- # {"text": "blues", "text_with_ws": "blues "},
271
- # {"text": "and", "text_with_ws": "and "},
272
- # {"text": "folk", "text_with_ws": "folk "},
273
- # {"text": "thrived", "text_with_ws": "thrived"},
274
- # {"text": ",", "text_with_ws": ", "},
275
- # {"text": "bringing", "text_with_ws": "bringing "},
276
- # {"text": "Fairport", "text_with_ws": "Fairport "},
277
- # {"text": "Convention", "text_with_ws": "Convention "},
278
- # {"text": "and", "text_with_ws": "and "},
279
- # {"text": "Van", "text_with_ws": "Van "},
280
- # {"text": "Morrison", "text_with_ws": "Morrison"},
281
- # {"text": ",", "text_with_ws": ", "},
282
- # {"text": "so", "text_with_ws": "so "},
283
- # {"text": "other", "text_with_ws": "other "},
284
- # {"text": "genres", "text_with_ws": "genres "},
285
- # {"text": "began", "text_with_ws": "began "},
286
- # {"text": "to", "text_with_ws": "to "},
287
- # {"text": "appear", "text_with_ws": "appear"},
288
- # {"text": ".", "text_with_ws": ". "},
289
- # ],
290
- # [
291
- # {"text": "From", "text_with_ws": "From "},
292
- # {"text": "the", "text_with_ws": "the "},
293
- # {"text": "psychedelia", "text_with_ws": "psychedelia "},
294
- # {"text": "of", "text_with_ws": "of "},
295
- # {"text": "The", "text_with_ws": "The "},
296
- # {"text": "Bonzo", "text_with_ws": "Bonzo "},
297
- # {"text": "Dog", "text_with_ws": "Dog "},
298
- # {"text": "Doo", "text_with_ws": "Doo "},
299
- # {"text": "Dah", "text_with_ws": "Dah "},
300
- # {"text": "Band", "text_with_ws": "Band "},
301
- # {"text": "and", "text_with_ws": "and "},
302
- # {"text": "Bob", "text_with_ws": "Bob "},
303
- # {"text": "Kerr", "text_with_ws": "Kerr"},
304
- # {"text": "'s", "text_with_ws": "'s "},
305
- # {"text": "Whoopee", "text_with_ws": "Whoopee "},
306
- # {"text": "Band", "text_with_ws": "Band"},
307
- # {"text": ";", "text_with_ws": "; "},
308
- # {"text": "to", "text_with_ws": "to "},
309
- # {"text": "1960s", "text_with_ws": "1960s "},
310
- # {"text": "mod", "text_with_ws": "mod "},
311
- # {"text": "groups", "text_with_ws": "groups "},
312
- # {"text": "The", "text_with_ws": "The "},
313
- # {"text": "Pretty", "text_with_ws": "Pretty "},
314
- # {"text": "Things", "text_with_ws": "Things "},
315
- # {"text": "and", "text_with_ws": "and "},
316
- # {"text": "Nashville", "text_with_ws": "Nashville "},
317
- # {"text": "Teens", "text_with_ws": "Teens"},
318
- # {"text": ";", "text_with_ws": "; "},
319
- # {"text": "to", "text_with_ws": "to "},
320
- # {"text": "early", "text_with_ws": "early "},
321
- # {"text": "pub", "text_with_ws": "pub"},
322
- # {"text": "-", "text_with_ws": "-"},
323
- # {"text": "rock", "text_with_ws": "rock "},
324
- # {"text": "acts", "text_with_ws": "acts "},
325
- # {"text": "such", "text_with_ws": "such "},
326
- # {"text": "as", "text_with_ws": "as "},
327
- # {"text": "Dr.", "text_with_ws": "Dr. "},
328
- # {"text": "Feelgood", "text_with_ws": "Feelgood"},
329
- # {"text": ";", "text_with_ws": "; "},
330
- # {"text": "the", "text_with_ws": "the "},
331
- # {"text": "Half", "text_with_ws": "Half "},
332
- # {"text": "Moon", "text_with_ws": "Moon "},
333
- # {"text": "was", "text_with_ws": "was "},
334
- # {"text": "the", "text_with_ws": "the "},
335
- # {"text": "centre", "text_with_ws": "centre "},
336
- # {"text": "of", "text_with_ws": "of "},
337
- # {"text": "not", "text_with_ws": "not "},
338
- # {"text": "only", "text_with_ws": "only "},
339
- # {"text": "the", "text_with_ws": "the "},
340
- # {"text": "emerging", "text_with_ws": "emerging "},
341
- # {"text": "music", "text_with_ws": "music "},
342
- # {"text": "scenes", "text_with_ws": "scenes "},
343
- # {"text": "but", "text_with_ws": "but "},
344
- # {"text": "continued", "text_with_ws": "continued "},
345
- # {"text": "to", "text_with_ws": "to "},
346
- # {"text": "host", "text_with_ws": "host "},
347
- # {"text": "the", "text_with_ws": "the "},
348
- # {"text": "big", "text_with_ws": "big "},
349
- # {"text": "names", "text_with_ws": "names"},
350
- # {"text": ".", "text_with_ws": "."},
351
- # ],
352
- # [
353
- # {"text": "It", "text_with_ws": "It "},
354
- # {"text": "was", "text_with_ws": "was "},
355
- # {"text": "never", "text_with_ws": "never "},
356
- # {"text": "just", "text_with_ws": "just "},
357
- # {"text": "a", "text_with_ws": "a "},
358
- # {"text": '"', "text_with_ws": '"'},
359
- # {"text": "passing", "text_with_ws": "passing "},
360
- # {"text": "through", "text_with_ws": "through"},
361
- # {"text": '"', "text_with_ws": '" '},
362
- # {"text": "venue", "text_with_ws": "venue"},
363
- # {"text": ".", "text_with_ws": ". "},
364
- # ],
365
- # [
366
- # {"text": "Residencies", "text_with_ws": "Residencies "},
367
- # {"text": "at", "text_with_ws": "at "},
368
- # {"text": "the", "text_with_ws": "the "},
369
- # {"text": "Half", "text_with_ws": "Half "},
370
- # {"text": "Moon", "text_with_ws": "Moon "},
371
- # {"text": "have", "text_with_ws": "have "},
372
- # {"text": "included", "text_with_ws": "included "},
373
- # {"text": "Elvis", "text_with_ws": "Elvis "},
374
- # {"text": "Costello", "text_with_ws": "Costello "},
375
- # {"text": "(", "text_with_ws": "("},
376
- # {"text": "who", "text_with_ws": "who "},
377
- # {"text": "would", "text_with_ws": "would "},
378
- # {"text": "play", "text_with_ws": "play "},
379
- # {"text": "a", "text_with_ws": "a "},
380
- # {"text": "couple", "text_with_ws": "couple "},
381
- # {"text": "of", "text_with_ws": "of "},
382
- # {"text": "times", "text_with_ws": "times "},
383
- # {"text": "a", "text_with_ws": "a "},
384
- # {"text": "month", "text_with_ws": "month "},
385
- # {"text": "in", "text_with_ws": "in "},
386
- # {"text": "the", "text_with_ws": "the "},
387
- # {"text": "mid-1970s", "text_with_ws": "mid-1970s "},
388
- # {"text": "for", "text_with_ws": "for "},
389
- # {"text": "50", "text_with_ws": "50 "},
390
- # {"text": "pence", "text_with_ws": "pence "},
391
- # {"text": "and", "text_with_ws": "and "},
392
- # {"text": "a", "text_with_ws": "a "},
393
- # {"text": "plate", "text_with_ws": "plate "},
394
- # {"text": "of", "text_with_ws": "of "},
395
- # {"text": "sandwiches", "text_with_ws": "sandwiches"},
396
- # {"text": ")", "text_with_ws": ") "},
397
- # {"text": "and", "text_with_ws": "and "},
398
- # {"text": "Steve", "text_with_ws": "Steve "},
399
- # {"text": "Marriott", "text_with_ws": "Marriott "},
400
- # {"text": "of", "text_with_ws": "of "},
401
- # {"text": "Small", "text_with_ws": "Small "},
402
- # {"text": "Faces", "text_with_ws": "Faces "},
403
- # {"text": "fame", "text_with_ws": "fame"},
404
- # {"text": ".", "text_with_ws": ". "},
405
- # ],
406
- # [
407
- # {"text": "John", "text_with_ws": "John "},
408
- # {"text": "Martyn", "text_with_ws": "Martyn "},
409
- # {"text": "returned", "text_with_ws": "returned "},
410
- # {"text": "decades", "text_with_ws": "decades "},
411
- # {"text": "after", "text_with_ws": "after "},
412
- # {"text": "his", "text_with_ws": "his "},
413
- # {"text": "first", "text_with_ws": "first "},
414
- # {"text": "Half", "text_with_ws": "Half "},
415
- # {"text": "Moon", "text_with_ws": "Moon "},
416
- # {"text": "gigs", "text_with_ws": "gigs "},
417
- # {"text": "to", "text_with_ws": "to "},
418
- # {"text": "a", "text_with_ws": "a "},
419
- # {"text": "week", "text_with_ws": "week"},
420
- # {"text": "-", "text_with_ws": "-"},
421
- # {"text": "long", "text_with_ws": "long "},
422
- # {"text": "residency", "text_with_ws": "residency"},
423
- # {"text": ",", "text_with_ws": ", "},
424
- # {"text": "and", "text_with_ws": "and "},
425
- # {"text": "both", "text_with_ws": "both "},
426
- # {"text": "Tim", "text_with_ws": "Tim "},
427
- # {"text": "Rose", "text_with_ws": "Rose "},
428
- # {"text": "and", "text_with_ws": "and "},
429
- # {"text": "Roy", "text_with_ws": "Roy "},
430
- # {"text": "Harper", "text_with_ws": "Harper "},
431
- # {"text": "did", "text_with_ws": "did "},
432
- # {"text": "the", "text_with_ws": "the "},
433
- # {"text": "same", "text_with_ws": "same"},
434
- # {"text": ".", "text_with_ws": ". "},
435
- # ],
436
- # [
437
- # {"text": " ", "text_with_ws": " "},
438
- # {"text": "On", "text_with_ws": "On "},
439
- # {"text": "24", "text_with_ws": "24 "},
440
- # {"text": "July", "text_with_ws": "July "},
441
- # {"text": "2007", "text_with_ws": "2007"},
442
- # {"text": ",", "text_with_ws": ", "},
443
- # {"text": "Welsh", "text_with_ws": "Welsh"},
444
- # {"text": "-", "text_with_ws": "-"},
445
- # {"text": "language", "text_with_ws": "language "},
446
- # {"text": "folk", "text_with_ws": "folk "},
447
- # {"text": "guitarist", "text_with_ws": "guitarist "},
448
- # {"text": "Meic", "text_with_ws": "Meic "},
449
- # {"text": "Stevens", "text_with_ws": "Stevens "},
450
- # {"text": "performed", "text_with_ws": "performed "},
451
- # {"text": "his", "text_with_ws": "his "},
452
- # {"text": "first", "text_with_ws": "first "},
453
- # {"text": "London", "text_with_ws": "London "},
454
- # {"text": "gig", "text_with_ws": "gig "},
455
- # {"text": "in", "text_with_ws": "in "},
456
- # {"text": "over", "text_with_ws": "over "},
457
- # {"text": "30", "text_with_ws": "30 "},
458
- # {"text": "years", "text_with_ws": "years "},
459
- # {"text": "at", "text_with_ws": "at "},
460
- # {"text": "the", "text_with_ws": "the "},
461
- # {"text": "Half", "text_with_ws": "Half "},
462
- # {"text": "Moon", "text_with_ws": "Moon"},
463
- # {"text": ".", "text_with_ws": ". "},
464
- # ],
465
- # [
466
- # {"text": " ", "text_with_ws": " "},
467
- # {"text": "Other", "text_with_ws": "Other "},
468
- # {"text": "memorable", "text_with_ws": "memorable "},
469
- # {"text": "gigs", "text_with_ws": "gigs "},
470
- # {"text": "at", "text_with_ws": "at "},
471
- # {"text": "the", "text_with_ws": "the "},
472
- # {"text": "Half", "text_with_ws": "Half "},
473
- # {"text": "Moon", "text_with_ws": "Moon "},
474
- # {"text": "include", "text_with_ws": "include "},
475
- # {"text": "k.d", "text_with_ws": "k.d"},
476
- # {"text": ".", "text_with_ws": ". "},
477
- # ],
478
- # [
479
- # {"text": "lang", "text_with_ws": "lang"},
480
- # {"text": "'s", "text_with_ws": "'s "},
481
- # {"text": "first", "text_with_ws": "first "},
482
- # {"text": "UK", "text_with_ws": "UK "},
483
- # {"text": "appearance", "text_with_ws": "appearance"},
484
- # {"text": ",", "text_with_ws": ", "},
485
- # {"text": "Kate", "text_with_ws": "Kate "},
486
- # {"text": "Bush", "text_with_ws": "Bush"},
487
- # {"text": "'s", "text_with_ws": "'s "},
488
- # {"text": "first", "text_with_ws": "first "},
489
- # {"text": "ever", "text_with_ws": "ever "},
490
- # {"text": "public", "text_with_ws": "public "},
491
- # {"text": "performance", "text_with_ws": "performance "},
492
- # {"text": "and", "text_with_ws": "and "},
493
- # {"text": "a", "text_with_ws": "a "},
494
- # {"text": "surprise", "text_with_ws": "surprise "},
495
- # {"text": "appearance", "text_with_ws": "appearance "},
496
- # {"text": "by", "text_with_ws": "by "},
497
- # {"text": "Nick", "text_with_ws": "Nick "},
498
- # {"text": "Cave", "text_with_ws": "Cave"},
499
- # {"text": ".", "text_with_ws": ". "},
500
- # ],
501
- # [
502
- # {"text": "The", "text_with_ws": "The "},
503
- # {"text": "Hamsters", "text_with_ws": "Hamsters "},
504
- # {"text": "played", "text_with_ws": "played "},
505
- # {"text": "their", "text_with_ws": "their "},
506
- # {"text": "last", "text_with_ws": "last "},
507
- # {"text": "shows", "text_with_ws": "shows "},
508
- # {"text": "at", "text_with_ws": "at "},
509
- # {"text": "the", "text_with_ws": "the "},
510
- # {"text": "Half", "text_with_ws": "Half "},
511
- # {"text": "Moon", "text_with_ws": "Moon "},
512
- # {"text": "in", "text_with_ws": "in "},
513
- # {"text": "2012", "text_with_ws": "2012"},
514
- # {"text": ",", "text_with_ws": ", "},
515
- # {"text": "signing", "text_with_ws": "signing "},
516
- # {"text": "off", "text_with_ws": "off "},
517
- # {"text": "on", "text_with_ws": "on "},
518
- # {"text": "a", "text_with_ws": "a "},
519
- # {"text": "25", "text_with_ws": "25"},
520
- # {"text": "-", "text_with_ws": "-"},
521
- # {"text": "year", "text_with_ws": "year "},
522
- # {"text": "career", "text_with_ws": "career "},
523
- # {"text": "with", "text_with_ws": "with "},
524
- # {"text": "five", "text_with_ws": "five "},
525
- # {"text": "shows", "text_with_ws": "shows "},
526
- # {"text": "over", "text_with_ws": "over "},
527
- # {"text": "one", "text_with_ws": "one "},
528
- # {"text": "weekend", "text_with_ws": "weekend"},
529
- # {"text": ".", "text_with_ws": "."},
530
- # ],
531
- # [
532
- # {"text": "The", "text_with_ws": "The "},
533
- # {"text": "Half", "text_with_ws": "Half "},
534
- # {"text": "Moon", "text_with_ws": "Moon "},
535
- # {"text": "has", "text_with_ws": "has "},
536
- # {"text": "always", "text_with_ws": "always "},
537
- # {"text": "been", "text_with_ws": "been "},
538
- # {"text": "synonymous", "text_with_ws": "synonymous "},
539
- # {"text": "with", "text_with_ws": "with "},
540
- # {"text": "The", "text_with_ws": "The "},
541
- # {"text": "Rolling", "text_with_ws": "Rolling "},
542
- # {"text": "Stones", "text_with_ws": "Stones"},
543
- # {"text": ",", "text_with_ws": ", "},
544
- # {"text": "whose", "text_with_ws": "whose "},
545
- # {"text": "most", "text_with_ws": "most "},
546
- # {"text": "recent", "text_with_ws": "recent "},
547
- # {"text": "visit", "text_with_ws": "visit "},
548
- # {"text": "was", "text_with_ws": "was "},
549
- # {"text": "a", "text_with_ws": "a "},
550
- # {"text": "private", "text_with_ws": "private "},
551
- # {"text": "event", "text_with_ws": "event "},
552
- # {"text": "held", "text_with_ws": "held "},
553
- # {"text": "in", "text_with_ws": "in "},
554
- # {"text": "May", "text_with_ws": "May "},
555
- # {"text": "2000", "text_with_ws": "2000"},
556
- # {"text": ".", "text_with_ws": ". "},
557
- # ],
558
- # [
559
- # {"text": "As", "text_with_ws": "As "},
560
- # {"text": "well", "text_with_ws": "well "},
561
- # {"text": "as", "text_with_ws": "as "},
562
- # {"text": "performing", "text_with_ws": "performing "},
563
- # {"text": "as", "text_with_ws": "as "},
564
- # {"text": "the", "text_with_ws": "the "},
565
- # {"text": "band", "text_with_ws": "band"},
566
- # {"text": ",", "text_with_ws": ", "},
567
- # {"text": "individual", "text_with_ws": "individual "},
568
- # {"text": "members", "text_with_ws": "members "},
569
- # {"text": "have", "text_with_ws": "have "},
570
- # {"text": "appeared", "text_with_ws": "appeared "},
571
- # {"text": "here", "text_with_ws": "here "},
572
- # {"text": "in", "text_with_ws": "in "},
573
- # {"text": "various", "text_with_ws": "various "},
574
- # {"text": "side", "text_with_ws": "side"},
575
- # {"text": "-", "text_with_ws": "-"},
576
- # {"text": "projects", "text_with_ws": "projects"},
577
- # {"text": ",", "text_with_ws": ", "},
578
- # {"text": "and", "text_with_ws": "and "},
579
- # {"text": "have", "text_with_ws": "have "},
580
- # {"text": "also", "text_with_ws": "also "},
581
- # {"text": "used", "text_with_ws": "used "},
582
- # {"text": "the", "text_with_ws": "the "},
583
- # {"text": "venue", "text_with_ws": "venue "},
584
- # {"text": "for", "text_with_ws": "for "},
585
- # {"text": "rehearsal", "text_with_ws": "rehearsal "},
586
- # {"text": "space", "text_with_ws": "space"},
587
- # {"text": ".", "text_with_ws": ". "},
588
- # ],
589
- # [
590
- # {"text": "In", "text_with_ws": "In "},
591
- # {"text": "January", "text_with_ws": "January "},
592
- # {"text": "2010", "text_with_ws": "2010"},
593
- # {"text": ",", "text_with_ws": ", "},
594
- # {"text": "the", "text_with_ws": "the "},
595
- # {"text": "Half", "text_with_ws": "Half "},
596
- # {"text": "Moon", "text_with_ws": "Moon "},
597
- # {"text": "almost", "text_with_ws": "almost "},
598
- # {"text": "closed", "text_with_ws": "closed "},
599
- # {"text": "due", "text_with_ws": "due "},
600
- # {"text": "to", "text_with_ws": "to "},
601
- # {"text": "failing", "text_with_ws": "failing "},
602
- # {"text": "sales", "text_with_ws": "sales"},
603
- # {"text": ",", "text_with_ws": ", "},
604
- # {"text": "rising", "text_with_ws": "rising "},
605
- # {"text": "rates", "text_with_ws": "rates "},
606
- # {"text": "and", "text_with_ws": "and "},
607
- # {"text": "the", "text_with_ws": "the "},
608
- # {"text": "recession", "text_with_ws": "recession"},
609
- # {"text": ",", "text_with_ws": ", "},
610
- # {"text": "but", "text_with_ws": "but "},
611
- # {"text": "they", "text_with_ws": "they "},
612
- # {"text": "received", "text_with_ws": "received "},
613
- # {"text": "hundreds", "text_with_ws": "hundreds "},
614
- # {"text": "of", "text_with_ws": "of "},
615
- # {"text": "signatures", "text_with_ws": "signatures "},
616
- # {"text": "and", "text_with_ws": "and "},
617
- # {"text": "a", "text_with_ws": "a "},
618
- # {"text": "Facebook", "text_with_ws": "Facebook "},
619
- # {"text": "campaign", "text_with_ws": "campaign "},
620
- # {"text": "of", "text_with_ws": "of "},
621
- # {"text": "6,500", "text_with_ws": "6,500 "},
622
- # {"text": "people", "text_with_ws": "people"},
623
- # {"text": ".", "text_with_ws": ". "},
624
- # ],
625
- # [
626
- # {"text": "Musicians", "text_with_ws": "Musicians "},
627
- # {"text": "such", "text_with_ws": "such "},
628
- # {"text": "as", "text_with_ws": "as "},
629
- # {"text": "The", "text_with_ws": "The "},
630
- # {"text": "X", "text_with_ws": "X "},
631
- # {"text": "Factor", "text_with_ws": "Factor "},
632
- # {"text": "finalist", "text_with_ws": "finalist "},
633
- # {"text": "Jamie", "text_with_ws": "Jamie "},
634
- # {"text": "Archer", "text_with_ws": "Archer"},
635
- # {"text": ",", "text_with_ws": ", "},
636
- # {"text": "Eddi", "text_with_ws": "Eddi "},
637
- # {"text": "Reader", "text_with_ws": "Reader "},
638
- # {"text": "and", "text_with_ws": "and "},
639
- # {"text": "Simon", "text_with_ws": "Simon "},
640
- # {"text": "Fowler", "text_with_ws": "Fowler "},
641
- # {"text": "supported", "text_with_ws": "supported "},
642
- # {"text": "the", "text_with_ws": "the "},
643
- # {"text": "petition", "text_with_ws": "petition "},
644
- # {"text": "as", "text_with_ws": "as "},
645
- # {"text": "well", "text_with_ws": "well"},
646
- # {"text": ".", "text_with_ws": ". "},
647
- # ],
648
- # [
649
- # {"text": "As", "text_with_ws": "As "},
650
- # {"text": "part", "text_with_ws": "part "},
651
- # {"text": "of", "text_with_ws": "of "},
652
- # {"text": "its", "text_with_ws": "its "},
653
- # {"text": "revival", "text_with_ws": "revival"},
654
- # {"text": ",", "text_with_ws": ", "},
655
- # {"text": "the", "text_with_ws": "the "},
656
- # {"text": "Half", "text_with_ws": "Half "},
657
- # {"text": "Moon", "text_with_ws": "Moon "},
658
- # {"text": "started", "text_with_ws": "started "},
659
- # {"text": "serving", "text_with_ws": "serving "},
660
- # {"text": "food", "text_with_ws": "food"},
661
- # {"text": ".", "text_with_ws": "."},
662
- # ],
663
- # [
664
- # {"text": "In", "text_with_ws": "In "},
665
- # {"text": "2012", "text_with_ws": "2012"},
666
- # {"text": ",", "text_with_ws": ", "},
667
- # {"text": "the", "text_with_ws": "the "},
668
- # {"text": "Half", "text_with_ws": "Half "},
669
- # {"text": "Moon", "text_with_ws": "Moon "},
670
- # {"text": "was", "text_with_ws": "was "},
671
- # {"text": "bought", "text_with_ws": "bought "},
672
- # {"text": "by", "text_with_ws": "by "},
673
- # {"text": "Geronimo", "text_with_ws": "Geronimo "},
674
- # {"text": "Inns", "text_with_ws": "Inns"},
675
- # {"text": ",", "text_with_ws": ", "},
676
- # {"text": "whose", "text_with_ws": "whose "},
677
- # {"text": "parent", "text_with_ws": "parent "},
678
- # {"text": "company", "text_with_ws": "company "},
679
- # {"text": "is", "text_with_ws": "is "},
680
- # {"text": "Young", "text_with_ws": "Young "},
681
- # {"text": "&", "text_with_ws": "& "},
682
- # {"text": "Co.", "text_with_ws": "Co."},
683
- # ],
684
- # ],
685
- # [
686
- # [
687
- # {"text": "Artists", "text_with_ws": "Artists "},
688
- # {"text": "who", "text_with_ws": "who "},
689
- # {"text": "have", "text_with_ws": "have "},
690
- # {"text": "performed", "text_with_ws": "performed "},
691
- # {"text": "or", "text_with_ws": "or "},
692
- # {"text": "recorded", "text_with_ws": "recorded "},
693
- # {"text": "at", "text_with_ws": "at "},
694
- # {"text": "the", "text_with_ws": "the "},
695
- # {"text": "venue", "text_with_ws": "venue "},
696
- # {"text": "since", "text_with_ws": "since "},
697
- # {"text": "the", "text_with_ws": "the "},
698
- # {"text": "mid-1960s", "text_with_ws": "mid-1960s "},
699
- # {"text": "include", "text_with_ws": "include "},
700
- # {"text": "the", "text_with_ws": "the "},
701
- # {"text": "Rolling", "text_with_ws": "Rolling "},
702
- # {"text": "Stones", "text_with_ws": "Stones"},
703
- # {"text": ",", "text_with_ws": ", "},
704
- # {"text": "The", "text_with_ws": "The "},
705
- # {"text": "Who", "text_with_ws": "Who"},
706
- # {"text": ",", "text_with_ws": ", "},
707
- # {"text": "The", "text_with_ws": "The "},
708
- # {"text": "Small", "text_with_ws": "Small "},
709
- # {"text": "Faces", "text_with_ws": "Faces"},
710
- # {"text": ",", "text_with_ws": ", "},
711
- # {"text": "Kasabian", "text_with_ws": "Kasabian"},
712
- # {"text": ",", "text_with_ws": ", "},
713
- # {"text": "Sisteray", "text_with_ws": "Sisteray"},
714
- # {"text": ",", "text_with_ws": ", "},
715
- # {"text": "Chris", "text_with_ws": "Chris "},
716
- # {"text": "Bell", "text_with_ws": "Bell"},
717
- # {"text": ",", "text_with_ws": ", "},
718
- # {"text": "Ralph", "text_with_ws": "Ralph "},
719
- # {"text": "McTell", "text_with_ws": "McTell"},
720
- # {"text": ",", "text_with_ws": ", "},
721
- # {"text": "GoodLuck", "text_with_ws": "GoodLuck"},
722
- # {"text": ",", "text_with_ws": ", "},
723
- # {"text": "John", "text_with_ws": "John "},
724
- # {"text": "Martyn", "text_with_ws": "Martyn"},
725
- # {"text": ",", "text_with_ws": ", "},
726
- # {"text": "John", "text_with_ws": "John "},
727
- # {"text": "Mayall", "text_with_ws": "Mayall"},
728
- # {"text": "'s", "text_with_ws": "'s "},
729
- # {"text": "Bluesbreakers", "text_with_ws": "Bluesbreakers"},
730
- # {"text": ",", "text_with_ws": ", "},
731
- # {"text": "Alexis", "text_with_ws": "Alexis "},
732
- # {"text": "Korner", "text_with_ws": "Korner"},
733
- # {"text": ",", "text_with_ws": ", "},
734
- # {"text": "The", "text_with_ws": "The "},
735
- # {"text": "Yardbirds", "text_with_ws": "Yardbirds"},
736
- # {"text": ",", "text_with_ws": ", "},
737
- # {"text": "Bob", "text_with_ws": "Bob "},
738
- # {"text": "Kerr", "text_with_ws": "Kerr"},
739
- # {"text": "'s", "text_with_ws": "'s "},
740
- # {"text": "Whoopee", "text_with_ws": "Whoopee "},
741
- # {"text": "Band", "text_with_ws": "Band"},
742
- # {"text": ",", "text_with_ws": ", "},
743
- # {"text": "Morrissey", "text_with_ws": "Morrissey"},
744
- # {"text": "–", "text_with_ws": "–"},
745
- # {"text": "Mullen", "text_with_ws": "Mullen "},
746
- # {"text": "–", "text_with_ws": "– "},
747
- # {"text": "who", "text_with_ws": "who "},
748
- # {"text": "had", "text_with_ws": "had "},
749
- # {"text": "a", "text_with_ws": "a "},
750
- # {"text": "residency", "text_with_ws": "residency "},
751
- # {"text": "there", "text_with_ws": "there "},
752
- # {"text": "of", "text_with_ws": "of "},
753
- # {"text": "several", "text_with_ws": "several "},
754
- # {"text": "years", "text_with_ws": "years"},
755
- # {"text": "'", "text_with_ws": "' "},
756
- # {"text": "standing", "text_with_ws": "standing"},
757
- # {"text": ",", "text_with_ws": ", "},
758
- # {"text": "Rocket", "text_with_ws": "Rocket "},
759
- # {"text": "88", "text_with_ws": "88"},
760
- # {"text": ",", "text_with_ws": ", "},
761
- # {"text": "Fairport", "text_with_ws": "Fairport "},
762
- # {"text": "Convention", "text_with_ws": "Convention"},
763
- # {"text": ",", "text_with_ws": ", "},
764
- # {"text": "Bonzo", "text_with_ws": "Bonzo "},
765
- # {"text": "Dog", "text_with_ws": "Dog "},
766
- # {"text": "Doo", "text_with_ws": "Doo"},
767
- # {"text": "-", "text_with_ws": "-"},
768
- # {"text": "Dah", "text_with_ws": "Dah "},
769
- # {"text": "Band", "text_with_ws": "Band"},
770
- # {"text": ",", "text_with_ws": ", "},
771
- # {"text": "Roy", "text_with_ws": "Roy "},
772
- # {"text": "Harper", "text_with_ws": "Harper"},
773
- # {"text": ",", "text_with_ws": ", "},
774
- # {"text": "Van", "text_with_ws": "Van "},
775
- # {"text": "Morrison", "text_with_ws": "Morrison"},
776
- # {"text": ",", "text_with_ws": ","},
777
- # {"text": "Man", "text_with_ws": "Man"},
778
- # {"text": ",", "text_with_ws": ", "},
779
- # {"text": "Danny", "text_with_ws": "Danny "},
780
- # {"text": "Thompson", "text_with_ws": "Thompson"},
781
- # {"text": ",", "text_with_ws": ", "},
782
- # {"text": "Dr.", "text_with_ws": "Dr. "},
783
- # {"text": "Feelgood", "text_with_ws": "Feelgood"},
784
- # {"text": ",", "text_with_ws": ", "},
785
- # {"text": "Elvis", "text_with_ws": "Elvis "},
786
- # {"text": "Costello", "text_with_ws": "Costello"},
787
- # {"text": ",", "text_with_ws": ", "},
788
- # {"text": "Meic", "text_with_ws": "Meic "},
789
- # {"text": "Stevens", "text_with_ws": "Stevens"},
790
- # {"text": ",", "text_with_ws": ", "},
791
- # {"text": "Finley", "text_with_ws": "Finley "},
792
- # {"text": "Quaye", "text_with_ws": "Quaye"},
793
- # {"text": ",", "text_with_ws": ", "},
794
- # {"text": "I", "text_with_ws": "I "},
795
- # {"text": "Am", "text_with_ws": "Am "},
796
- # {"text": "Kloot", "text_with_ws": "Kloot"},
797
- # {"text": ",", "text_with_ws": ", "},
798
- # {"text": "Starlite", "text_with_ws": "Starlite "},
799
- # {"text": "Campbell", "text_with_ws": "Campbell "},
800
- # {"text": "Band", "text_with_ws": "Band"},
801
- # {"text": ",", "text_with_ws": ", "},
802
- # {"text": "Beverley", "text_with_ws": "Beverley "},
803
- # {"text": "Craven", "text_with_ws": "Craven"},
804
- # {"text": ",", "text_with_ws": ", "},
805
- # {"text": "Bo", "text_with_ws": "Bo "},
806
- # {"text": "Diddley", "text_with_ws": "Diddley"},
807
- # {"text": ",", "text_with_ws": ", "},
808
- # {"text": "John", "text_with_ws": "John "},
809
- # {"text": "Otway", "text_with_ws": "Otway"},
810
- # {"text": ",", "text_with_ws": ", "},
811
- # {"text": "Tim", "text_with_ws": "Tim "},
812
- # {"text": "Rose", "text_with_ws": "Rose"},
813
- # {"text": ",", "text_with_ws": ", "},
814
- # {"text": "Amy", "text_with_ws": "Amy "},
815
- # {"text": "McDonald", "text_with_ws": "McDonald"},
816
- # {"text": ",", "text_with_ws": ", "},
817
- # {"text": "Catfish", "text_with_ws": "Catfish "},
818
- # {"text": "Keith", "text_with_ws": "Keith"},
819
- # {"text": ",", "text_with_ws": ", "},
820
- # {"text": "The", "text_with_ws": "The "},
821
- # {"text": "MonaLisa", "text_with_ws": "MonaLisa "},
822
- # {"text": "Twins", "text_with_ws": "Twins"},
823
- # {"text": ",", "text_with_ws": ", "},
824
- # {"text": "as", "text_with_ws": "as "},
825
- # {"text": "well", "text_with_ws": "well "},
826
- # {"text": "as", "text_with_ws": "as "},
827
- # {"text": "k.d", "text_with_ws": "k.d"},
828
- # {"text": ".", "text_with_ws": ". "},
829
- # ],
830
- # [
831
- # {"text": "lang", "text_with_ws": "lang"},
832
- # {"text": "'s", "text_with_ws": "'s "},
833
- # {"text": "first", "text_with_ws": "first "},
834
- # {"text": "UK", "text_with_ws": "UK "},
835
- # {"text": "appearance", "text_with_ws": "appearance"},
836
- # {"text": ",", "text_with_ws": ", "},
837
- # {"text": "and", "text_with_ws": "and "},
838
- # {"text": "Kate", "text_with_ws": "Kate "},
839
- # {"text": "Bush", "text_with_ws": "Bush"},
840
- # {"text": "'s", "text_with_ws": "'s "},
841
- # {"text": "first", "text_with_ws": "first "},
842
- # {"text": "public", "text_with_ws": "public "},
843
- # {"text": "performance", "text_with_ws": "performance"},
844
- # {"text": ".", "text_with_ws": "."},
845
- # ],
846
- # [
847
- # {"text": "It", "text_with_ws": "It "},
848
- # {"text": "has", "text_with_ws": "has "},
849
- # {"text": "also", "text_with_ws": "also "},
850
- # {"text": "hosted", "text_with_ws": "hosted "},
851
- # {"text": "comedy", "text_with_ws": "comedy"},
852
- # {"text": ",", "text_with_ws": ", "},
853
- # {"text": "including", "text_with_ws": "including "},
854
- # {"text": "Billy", "text_with_ws": "Billy "},
855
- # {"text": "Connolly", "text_with_ws": "Connolly"},
856
- # {"text": ",", "text_with_ws": ", "},
857
- # {"text": "Andy", "text_with_ws": "Andy "},
858
- # {"text": "Parsons", "text_with_ws": "Parsons"},
859
- # {"text": ",", "text_with_ws": ", "},
860
- # {"text": "Harry", "text_with_ws": "Harry "},
861
- # {"text": "Hill", "text_with_ws": "Hill"},
862
- # {"text": ",", "text_with_ws": ", "},
863
- # {"text": "Rufus", "text_with_ws": "Rufus "},
864
- # {"text": "Hound", "text_with_ws": "Hound"},
865
- # {"text": ",", "text_with_ws": ", "},
866
- # {"text": "Shappi", "text_with_ws": "Shappi "},
867
- # {"text": "Khorsandi", "text_with_ws": "Khorsandi"},
868
- # {"text": ",", "text_with_ws": ", "},
869
- # {"text": "Norman", "text_with_ws": "Norman "},
870
- # {"text": "Lovett", "text_with_ws": "Lovett"},
871
- # {"text": ",", "text_with_ws": ", "},
872
- # {"text": "Bob", "text_with_ws": "Bob "},
873
- # {"text": "Mills", "text_with_ws": "Mills"},
874
- # {"text": ",", "text_with_ws": ", "},
875
- # {"text": "Milton", "text_with_ws": "Milton "},
876
- # {"text": "Jones", "text_with_ws": "Jones"},
877
- # {"text": ",", "text_with_ws": ", "},
878
- # {"text": "Al", "text_with_ws": "Al "},
879
- # {"text": "Murray", "text_with_ws": "Murray"},
880
- # {"text": ",", "text_with_ws": ", "},
881
- # {"text": "Stewart", "text_with_ws": "Stewart "},
882
- # {"text": "Lee", "text_with_ws": "Lee"},
883
- # {"text": ",", "text_with_ws": ", "},
884
- # {"text": "Richard", "text_with_ws": "Richard "},
885
- # {"text": "Herring", "text_with_ws": "Herring"},
886
- # {"text": ",", "text_with_ws": ", "},
887
- # {"text": "Jack", "text_with_ws": "Jack "},
888
- # {"text": "Whitehall", "text_with_ws": "Whitehall"},
889
- # {"text": ",", "text_with_ws": ", "},
890
- # {"text": "Alistair", "text_with_ws": "Alistair "},
891
- # {"text": "McGowan", "text_with_ws": "McGowan"},
892
- # {"text": ",", "text_with_ws": ", "},
893
- # {"text": "Katherine", "text_with_ws": "Katherine "},
894
- # {"text": "Ryan", "text_with_ws": "Ryan"},
895
- # {"text": ",", "text_with_ws": ", "},
896
- # {"text": "Cardinal", "text_with_ws": "Cardinal "},
897
- # {"text": "Burns", "text_with_ws": "Burns"},
898
- # {"text": ",", "text_with_ws": ", "},
899
- # {"text": "Reginald", "text_with_ws": "Reginald "},
900
- # {"text": "D.", "text_with_ws": "D. "},
901
- # {"text": "Hunter", "text_with_ws": "Hunter"},
902
- # {"text": ",", "text_with_ws": ", "},
903
- # {"text": "Stewart", "text_with_ws": "Stewart "},
904
- # {"text": "Francis", "text_with_ws": "Francis"},
905
- # {"text": ",", "text_with_ws": ", "},
906
- # {"text": "Bridget", "text_with_ws": "Bridget "},
907
- # {"text": "Christie", "text_with_ws": "Christie"},
908
- # {"text": ",", "text_with_ws": ", "},
909
- # {"text": "Josh", "text_with_ws": "Josh "},
910
- # {"text": "Widdicombe", "text_with_ws": "Widdicombe"},
911
- # {"text": ",", "text_with_ws": ", "},
912
- # {"text": "Sara", "text_with_ws": "Sara "},
913
- # {"text": "Pascoe", "text_with_ws": "Pascoe"},
914
- # {"text": ",", "text_with_ws": ", "},
915
- # {"text": "Rob", "text_with_ws": "Rob "},
916
- # {"text": "Beckett", "text_with_ws": "Beckett"},
917
- # {"text": ",", "text_with_ws": ", "},
918
- # {"text": "Sean", "text_with_ws": "Sean "},
919
- # {"text": "Hughes", "text_with_ws": "Hughes"},
920
- # {"text": ",", "text_with_ws": ", "},
921
- # {"text": "Kevin", "text_with_ws": "Kevin "},
922
- # {"text": "Eldon", "text_with_ws": "Eldon"},
923
- # {"text": ",", "text_with_ws": ", "},
924
- # {"text": "Henning", "text_with_ws": "Henning "},
925
- # {"text": "Wehn", "text_with_ws": "Wehn"},
926
- # {"text": ",", "text_with_ws": ", "},
927
- # {"text": "Hal", "text_with_ws": "Hal "},
928
- # {"text": "Cruttenden", "text_with_ws": "Cruttenden"},
929
- # {"text": ",", "text_with_ws": ", "},
930
- # {"text": "Holly", "text_with_ws": "Holly "},
931
- # {"text": "Walsh", "text_with_ws": "Walsh"},
932
- # {"text": ",", "text_with_ws": ", "},
933
- # {"text": "Danny", "text_with_ws": "Danny "},
934
- # {"text": "Bhoy", "text_with_ws": "Bhoy"},
935
- # {"text": ",", "text_with_ws": ", "},
936
- # {"text": "Aisling", "text_with_ws": "Aisling "},
937
- # {"text": "Bea", "text_with_ws": "Bea "},
938
- # {"text": "and", "text_with_ws": "and "},
939
- # {"text": "James", "text_with_ws": "James "},
940
- # {"text": "Acaster", "text_with_ws": "Acaster"},
941
- # {"text": ".", "text_with_ws": "."},
942
- # ],
943
- # ],
944
- # [
945
- # [
946
- # {"text": "Live", "text_with_ws": "Live "},
947
- # {"text": "at", "text_with_ws": "at "},
948
- # {"text": "Half", "text_with_ws": "Half "},
949
- # {"text": "Moon", "text_with_ws": "Moon "},
950
- # {"text": "Putney", "text_with_ws": "Putney "},
951
- # {"text": "–", "text_with_ws": "– "},
952
- # {"text": "The", "text_with_ws": "The "},
953
- # {"text": "De", "text_with_ws": "De "},
954
- # {"text": "Luxe", "text_with_ws": "Luxe "},
955
- # {"text": "Blues", "text_with_ws": "Blues "},
956
- # {"text": "Band", "text_with_ws": "Band "},
957
- # {"text": "(", "text_with_ws": "("},
958
- # {"text": "1981)An", "text_with_ws": "1981)An "},
959
- # {"text": "Evening", "text_with_ws": "Evening "},
960
- # {"text": "with", "text_with_ws": "with "},
961
- # {"text": "Meic", "text_with_ws": "Meic "},
962
- # {"text": "Stevens", "text_with_ws": "Stevens"},
963
- # {"text": ":", "text_with_ws": ": "},
964
- # {"text": "Recorded", "text_with_ws": "Recorded "},
965
- # {"text": "Live", "text_with_ws": "Live "},
966
- # {"text": "in", "text_with_ws": "in "},
967
- # {"text": "London", "text_with_ws": "London "},
968
- # {"text": "–", "text_with_ws": "– "},
969
- # {"text": "Meic", "text_with_ws": "Meic "},
970
- # {"text": "Stevens", "text_with_ws": "Stevens "},
971
- # {"text": "(", "text_with_ws": "("},
972
- # {"text": "2007", "text_with_ws": "2007"},
973
- # {"text": ")", "text_with_ws": ") "},
974
- # {"text": "(", "text_with_ws": "("},
975
- # {"text": "Sunbeam", "text_with_ws": "Sunbeam "},
976
- # {"text": "Records", "text_with_ws": "Records"},
977
- # {"text": ",", "text_with_ws": ", "},
978
- # {"text": "SBRCD5033)Live", "text_with_ws": "SBRCD5033)Live "},
979
- # {"text": "at", "text_with_ws": "at "},
980
- # {"text": "the", "text_with_ws": "the "},
981
- # {"text": "Half", "text_with_ws": "Half "},
982
- # {"text": "Moon", "text_with_ws": "Moon "},
983
- # {"text": "–", "text_with_ws": "– "},
984
- # {"text": "Catfish", "text_with_ws": "Catfish "},
985
- # {"text": "Keith", "text_with_ws": "Keith "},
986
- # {"text": "(", "text_with_ws": "("},
987
- # {"text": "2009", "text_with_ws": "2009"},
988
- # {"text": ")", "text_with_ws": ") "},
989
- # {"text": "(", "text_with_ws": "("},
990
- # {"text": "Fish", "text_with_ws": "Fish "},
991
- # {"text": "Tail", "text_with_ws": "Tail "},
992
- # {"text": "Records)Remember", "text_with_ws": "Records)Remember"},
993
- # {"text": ",", "text_with_ws": ", "},
994
- # {"text": "on", "text_with_ws": "on "},
995
- # {"text": "stage", "text_with_ws": "stage "},
996
- # {"text": "at", "text_with_ws": "at "},
997
- # {"text": "The", "text_with_ws": "The "},
998
- # {"text": "Half", "text_with_ws": "Half "},
999
- # {"text": "Moon", "text_with_ws": "Moon "},
1000
- # {"text": "–", "text_with_ws": "– "},
1001
- # {"text": "Latin", "text_with_ws": "Latin "},
1002
- # {"text": "Quarter", "text_with_ws": "Quarter "},
1003
- # {"text": "(", "text_with_ws": "("},
1004
- # {"text": "2023", "text_with_ws": "2023"},
1005
- # {"text": ")", "text_with_ws": ") "},
1006
- # {"text": "(", "text_with_ws": "("},
1007
- # {"text": "Westpark", "text_with_ws": "Westpark "},
1008
- # {"text": "Music", "text_with_ws": "Music"},
1009
- # {"text": ")", "text_with_ws": ")"},
1010
- # ]
1011
- # ],
1012
- # [
1013
- # [
1014
- # {"text": "The", "text_with_ws": "The "},
1015
- # {"text": "pub", "text_with_ws": "pub "},
1016
- # {"text": "is", "text_with_ws": "is "},
1017
- # {"text": "served", "text_with_ws": "served "},
1018
- # {"text": "by", "text_with_ws": "by "},
1019
- # {"text": "Transport", "text_with_ws": "Transport "},
1020
- # {"text": "for", "text_with_ws": "for "},
1021
- # {"text": "London", "text_with_ws": "London "},
1022
- # {"text": "buses", "text_with_ws": "buses "},
1023
- # {"text": "22", "text_with_ws": "22"},
1024
- # {"text": ",", "text_with_ws": ", "},
1025
- # {"text": "265", "text_with_ws": "265"},
1026
- # {"text": ",", "text_with_ws": ", "},
1027
- # {"text": "378", "text_with_ws": "378"},
1028
- # {"text": ",", "text_with_ws": ", "},
1029
- # {"text": "485", "text_with_ws": "485 "},
1030
- # {"text": "which", "text_with_ws": "which "},
1031
- # {"text": "stop", "text_with_ws": "stop "},
1032
- # {"text": "on", "text_with_ws": "on "},
1033
- # {"text": "the", "text_with_ws": "the "},
1034
- # {"text": "Lower", "text_with_ws": "Lower "},
1035
- # {"text": "Richmond", "text_with_ws": "Richmond "},
1036
- # {"text": "road", "text_with_ws": "road"},
1037
- # {"text": ".", "text_with_ws": ". "},
1038
- # ],
1039
- # [
1040
- # {"text": "Putney", "text_with_ws": "Putney "},
1041
- # {"text": "Bridge", "text_with_ws": "Bridge "},
1042
- # {"text": "tube", "text_with_ws": "tube "},
1043
- # {"text": "station", "text_with_ws": "station "},
1044
- # {"text": "(", "text_with_ws": "("},
1045
- # {"text": "District", "text_with_ws": "District "},
1046
- # {"text": "line", "text_with_ws": "line"},
1047
- # {"text": ")", "text_with_ws": ") "},
1048
- # {"text": "is", "text_with_ws": "is "},
1049
- # {"text": "a", "text_with_ws": "a "},
1050
- # {"text": "12", "text_with_ws": "12 "},
1051
- # {"text": "minute", "text_with_ws": "minute "},
1052
- # {"text": "walk", "text_with_ws": "walk "},
1053
- # {"text": "over", "text_with_ws": "over "},
1054
- # {"text": "Putney", "text_with_ws": "Putney "},
1055
- # {"text": "Bridge", "text_with_ws": "Bridge "},
1056
- # {"text": "and", "text_with_ws": "and "},
1057
- # {"text": "Putney", "text_with_ws": "Putney "},
1058
- # {"text": "railway", "text_with_ws": "railway "},
1059
- # {"text": "station", "text_with_ws": "station "},
1060
- # {"text": "(", "text_with_ws": "("},
1061
- # {"text": "Southwestern", "text_with_ws": "Southwestern "},
1062
- # {"text": "Railway", "text_with_ws": "Railway"},
1063
- # {"text": ")", "text_with_ws": ") "},
1064
- # {"text": "is", "text_with_ws": "is "},
1065
- # {"text": "a", "text_with_ws": "a "},
1066
- # {"text": "12", "text_with_ws": "12 "},
1067
- # {"text": "minute", "text_with_ws": "minute "},
1068
- # {"text": "walk", "text_with_ws": "walk "},
1069
- # {"text": "up", "text_with_ws": "up "},
1070
- # {"text": "Putney", "text_with_ws": "Putney "},
1071
- # {"text": "High", "text_with_ws": "High "},
1072
- # {"text": "Street", "text_with_ws": "Street"},
1073
- # {"text": ".", "text_with_ws": "."},
1074
- # ],
1075
- # [
1076
- # {"text": "The", "text_with_ws": "The "},
1077
- # {"text": "Santander", "text_with_ws": "Santander "},
1078
- # {"text": "Cycles", "text_with_ws": "Cycles "},
1079
- # {"text": "Putney", "text_with_ws": "Putney "},
1080
- # {"text": "Pier", "text_with_ws": "Pier "},
1081
- # {"text": "docking", "text_with_ws": "docking "},
1082
- # {"text": "station", "text_with_ws": "station "},
1083
- # {"text": "is", "text_with_ws": "is "},
1084
- # {"text": "a", "text_with_ws": "a "},
1085
- # {"text": "three", "text_with_ws": "three "},
1086
- # {"text": "minute", "text_with_ws": "minute "},
1087
- # {"text": "walk", "text_with_ws": "walk"},
1088
- # {"text": ".", "text_with_ws": "."},
1089
- # ],
1090
- # ],
1091
- # [
1092
- # [
1093
- # {"text": "Official", "text_with_ws": "Official "},
1094
- # {"text": "website", "text_with_ws": "website"},
1095
- # ]
1096
- # ],
1097
- # ],
1098
- # "top_spans": [
1099
- # [0, 2],
1100
- # [4, 13],
1101
- # [11, 13],
1102
- # [13, 13],
1103
- # [15, 15],
1104
- # [17, 26],
1105
- # [17, 36],
1106
- # [19, 21],
1107
- # [19, 26],
1108
- # [30, 30],
1109
- # [31, 32],
1110
- # [33, 34],
1111
- # [36, 36],
1112
- # [38, 39],
1113
- # [42, 49],
1114
- # [42, 56],
1115
- # [46, 49],
1116
- # [52, 56],
1117
- # [53, 53],
1118
- # [58, 60],
1119
- # [62, 74],
1120
- # [64, 65],
1121
- # [64, 74],
1122
- # [77, 79],
1123
- # [81, 88],
1124
- # [87, 88],
1125
- # [90, 90],
1126
- # [94, 96],
1127
- # [94, 97],
1128
- # [94, 100],
1129
- # [99, 100],
1130
- # [102, 103],
1131
- # [105, 105],
1132
- # [106, 107],
1133
- # [108, 109],
1134
- # [111, 111],
1135
- # [111, 112],
1136
- # [115, 129],
1137
- # [115, 131],
1138
- # [122, 123],
1139
- # [122, 129],
1140
- # [122, 131],
1141
- # [125, 125],
1142
- # [125, 126],
1143
- # [128, 129],
1144
- # [131, 131],
1145
- # [138, 139],
1146
- # [143, 143],
1147
- # [144, 148],
1148
- # [144, 154],
1149
- # [150, 154],
1150
- # [156, 156],
1151
- # [158, 162],
1152
- # [158, 169],
1153
- # [161, 162],
1154
- # [164, 166],
1155
- # [164, 169],
1156
- # [168, 169],
1157
- # [171, 172],
1158
- # [174, 175],
1159
- # [174, 184],
1160
- # [177, 178],
1161
- # [180, 181],
1162
- # [183, 184],
1163
- # [186, 188],
1164
- # [186, 189],
1165
- # [186, 196],
1166
- # [191, 192],
1167
- # [195, 196],
1168
- # [198, 215],
1169
- # [203, 215],
1170
- # [204, 205],
1171
- # [209, 212],
1172
- # [209, 215],
1173
- # [214, 214],
1174
- # [214, 215],
1175
- # [218, 220],
1176
- # [221, 221],
1177
- # [223, 223],
1178
- # [224, 225],
1179
- # [224, 228],
1180
- # [227, 228],
1181
- # [231, 232],
1182
- # [238, 252],
1183
- # [241, 246],
1184
- # [241, 252],
1185
- # [248, 250],
1186
- # [248, 252],
1187
- # [255, 255],
1188
- # [255, 263],
1189
- # [258, 260],
1190
- # [262, 262],
1191
- # [262, 263],
1192
- # [266, 274],
1193
- # [273, 274],
1194
- # [276, 278],
1195
- # [280, 288],
1196
- # [283, 288],
1197
- # [292, 292],
1198
- # [293, 295],
1199
- # [297, 297],
1200
- # [310, 312],
1201
- # [315, 337],
1202
- # [315, 338],
1203
- # [320, 320],
1204
- # [328, 329],
1205
- # [340, 345],
1206
- # [343, 345],
1207
- # [347, 348],
1208
- # [349, 349],
1209
- # [350, 356],
1210
- # [352, 352],
1211
- # [352, 356],
1212
- # [354, 355],
1213
- # [358, 362],
1214
- # [365, 370],
1215
- # [366, 367],
1216
- # [369, 370],
1217
- # [375, 375],
1218
- # [377, 378],
1219
- # [377, 379],
1220
- # [381, 387],
1221
- # [389, 389],
1222
- # [389, 400],
1223
- # [391, 391],
1224
- # [398, 400],
1225
- # [402, 402],
1226
- # [403, 409],
1227
- # [407, 409],
1228
- # [416, 416],
1229
- # [419, 421],
1230
- # [419, 425],
1231
- # [427, 432],
1232
- # [431, 432],
1233
- # [434, 435],
1234
- # [437, 437],
1235
- # [441, 443],
1236
- # [445, 445],
1237
- # [450, 454],
1238
- # [456, 460],
1239
- # [462, 464],
1240
- # [470, 485],
1241
- # [482, 482],
1242
- # [484, 485],
1243
- # [490, 490],
1244
- # [492, 493],
1245
- # [495, 496],
1246
- # [501, 504],
1247
- # [510, 511],
1248
- # [513, 514],
1249
- # [517, 517],
1250
- # [517, 518],
1251
- # [518, 518],
1252
- # [520, 522],
1253
- # [524, 524],
1254
- # [527, 528],
1255
- # [527, 534],
1256
- # [530, 531],
1257
- # [537, 537],
1258
- # [538, 538],
1259
- # [539, 548],
1260
- # [543, 548],
1261
- # [550, 564],
1262
- # [553, 558],
1263
- # [553, 564],
1264
- # [554, 555],
1265
- # [554, 556],
1266
- # [560, 561],
1267
- # [563, 564],
1268
- # [565, 565],
1269
- # [566, 567],
1270
- # [572, 575],
1271
- # [574, 574],
1272
- # [574, 575],
1273
- # [577, 579],
1274
- # [582, 582],
1275
- # [585, 585],
1276
- # [587, 589],
1277
- # [591, 591],
1278
- # [593, 602],
1279
- # [606, 606],
1280
- # [608, 608],
1281
- # [610, 611],
1282
- # [613, 614],
1283
- # [616, 618],
1284
- # [620, 621],
1285
- # [623, 625],
1286
- # [627, 627],
1287
- # [629, 629],
1288
- # [631, 632],
1289
- # [634, 635],
1290
- # [634, 651],
1291
- # [637, 637],
1292
- # [639, 640],
1293
- # [642, 644],
1294
- # [642, 645],
1295
- # [642, 651],
1296
- # [647, 648],
1297
- # [647, 675],
1298
- # [650, 651],
1299
- # [653, 655],
1300
- # [653, 657],
1301
- # [659, 672],
1302
- # [674, 675],
1303
- # [677, 678],
1304
- # [677, 705],
1305
- # [680, 685],
1306
- # [680, 693],
1307
- # [680, 705],
1308
- # [687, 688],
1309
- # [687, 693],
1310
- # [687, 716],
1311
- # [690, 691],
1312
- # [690, 693],
1313
- # [690, 716],
1314
- # [693, 693],
1315
- # [695, 696],
1316
- # [695, 716],
1317
- # [698, 699],
1318
- # [701, 702],
1319
- # [701, 705],
1320
- # [704, 705],
1321
- # [707, 708],
1322
- # [707, 712],
1323
- # [710, 712],
1324
- # [714, 716],
1325
- # [714, 738],
1326
- # [718, 719],
1327
- # [718, 722],
1328
- # [718, 734],
1329
- # [718, 738],
1330
- # [721, 721],
1331
- # [721, 722],
1332
- # [724, 725],
1333
- # [727, 728],
1334
- # [730, 731],
1335
- # [733, 734],
1336
- # [736, 738],
1337
- # [748, 748],
1338
- # [752, 754],
1339
- # [752, 757],
1340
- # [759, 759],
1341
- # [766, 767],
1342
- # [766, 779],
1343
- # [766, 785],
1344
- # [769, 770],
1345
- # [772, 773],
1346
- # [772, 779],
1347
- # [772, 785],
1348
- # [772, 801],
1349
- # [775, 776],
1350
- # [775, 779],
1351
- # [775, 785],
1352
- # [775, 791],
1353
- # [775, 801],
1354
- # [775, 803],
1355
- # [778, 779],
1356
- # [778, 801],
1357
- # [778, 806],
1358
- # [781, 782],
1359
- # [781, 785],
1360
- # [781, 791],
1361
- # [781, 801],
1362
- # [781, 806],
1363
- # [781, 807],
1364
- # [781, 809],
1365
- # [781, 810],
1366
- # [784, 785],
1367
- # [784, 801],
1368
- # [784, 807],
1369
- # [784, 810],
1370
- # [787, 788],
1371
- # [787, 789],
1372
- # [787, 791],
1373
- # [787, 800],
1374
- # [787, 801],
1375
- # [787, 803],
1376
- # [787, 806],
1377
- # [787, 809],
1378
- # [787, 810],
1379
- # [787, 816],
1380
- # [790, 791],
1381
- # [790, 801],
1382
- # [790, 806],
1383
- # [790, 807],
1384
- # [790, 809],
1385
- # [790, 816],
1386
- # [793, 794],
1387
- # [793, 801],
1388
- # [793, 816],
1389
- # [796, 797],
1390
- # [796, 801],
1391
- # [796, 809],
1392
- # [796, 810],
1393
- # [796, 822],
1394
- # [799, 800],
1395
- # [802, 802],
1396
- # [802, 803],
1397
- # [802, 806],
1398
- # [802, 816],
1399
- # [802, 823],
1400
- # [802, 831],
1401
- # [805, 806],
1402
- # [805, 831],
1403
- # [808, 809],
1404
- # [808, 831],
1405
- # [808, 832],
1406
- # [811, 813],
1407
- # [811, 816],
1408
- # [811, 822],
1409
- # [811, 825],
1410
- # [811, 831],
1411
- # [811, 832],
1412
- # [811, 837],
1413
- # [811, 838],
1414
- # [815, 816],
1415
- # [818, 819],
1416
- # [821, 822],
1417
- # [823, 823],
1418
- # [824, 825],
1419
- # [824, 828],
1420
- # [824, 831],
1421
- # [824, 838],
1422
- # [824, 852],
1423
- # [827, 828],
1424
- # [830, 831],
1425
- # [833, 834],
1426
- # [833, 837],
1427
- # [833, 843],
1428
- # [833, 849],
1429
- # [833, 852],
1430
- # [836, 837],
1431
- # [839, 840],
1432
- # [839, 843],
1433
- # [839, 852],
1434
- # [842, 843],
1435
- # [845, 846],
1436
- # [845, 849],
1437
- # [848, 849],
1438
- # [851, 852],
1439
- # [854, 871],
1440
- # [854, 881],
1441
- # [856, 858],
1442
- # [860, 864],
1443
- # [860, 870],
1444
- # [865, 865],
1445
- # [866, 866],
1446
- # [869, 870],
1447
- # [872, 872],
1448
- # [872, 900],
1449
- # [875, 875],
1450
- # [877, 878],
1451
- # [880, 880],
1452
- # [880, 881],
1453
- # [883, 884],
1454
- # [886, 886],
1455
- # [888, 890],
1456
- # [892, 893],
1457
- # [895, 895],
1458
- # [897, 913],
1459
- # [897, 917],
1460
- # [898, 917],
1461
- # [902, 917],
1462
- # [903, 917],
1463
- # [905, 907],
1464
- # [905, 910],
1465
- # [912, 912],
1466
- # [914, 917],
1467
- # [915, 916],
1468
- # [918, 919],
1469
- # [923, 933],
1470
- # [923, 940],
1471
- # [925, 925],
1472
- # [925, 933],
1473
- # [925, 940],
1474
- # [935, 935],
1475
- # [937, 940],
1476
- # [942, 943],
1477
- # [942, 949],
1478
- # [947, 948],
1479
- # [951, 954],
1480
- # [951, 957],
1481
- # [956, 957],
1482
- # [959, 959],
1483
- # [959, 965],
1484
- # [963, 964],
1485
- # [967, 970],
1486
- # [967, 974],
1487
- # [972, 974],
1488
- # [976, 982],
1489
- # [977, 977],
1490
- # [977, 978],
1491
- # [977, 980],
1492
- # [979, 979],
1493
- # [979, 980],
1494
- # [984, 987],
1495
- # [989, 990],
1496
- # ],
1497
- # "antecedents": [
1498
- # None,
1499
- # None,
1500
- # None,
1501
- # None,
1502
- # [0, 2],
1503
- # None,
1504
- # None,
1505
- # [13, 13],
1506
- # None,
1507
- # None,
1508
- # None,
1509
- # None,
1510
- # None,
1511
- # [0, 2],
1512
- # None,
1513
- # None,
1514
- # None,
1515
- # None,
1516
- # [13, 13],
1517
- # [0, 2],
1518
- # None,
1519
- # [13, 13],
1520
- # None,
1521
- # None,
1522
- # None,
1523
- # None,
1524
- # None,
1525
- # None,
1526
- # None,
1527
- # None,
1528
- # None,
1529
- # [58, 60],
1530
- # None,
1531
- # None,
1532
- # None,
1533
- # None,
1534
- # None,
1535
- # None,
1536
- # None,
1537
- # None,
1538
- # None,
1539
- # None,
1540
- # None,
1541
- # None,
1542
- # None,
1543
- # [36, 36],
1544
- # [115, 131],
1545
- # None,
1546
- # None,
1547
- # None,
1548
- # None,
1549
- # [144, 148],
1550
- # None,
1551
- # None,
1552
- # None,
1553
- # None,
1554
- # None,
1555
- # None,
1556
- # None,
1557
- # None,
1558
- # None,
1559
- # None,
1560
- # None,
1561
- # None,
1562
- # None,
1563
- # None,
1564
- # None,
1565
- # None,
1566
- # None,
1567
- # None,
1568
- # None,
1569
- # None,
1570
- # None,
1571
- # None,
1572
- # None,
1573
- # None,
1574
- # None,
1575
- # None,
1576
- # None,
1577
- # None,
1578
- # None,
1579
- # None,
1580
- # None,
1581
- # None,
1582
- # None,
1583
- # None,
1584
- # None,
1585
- # None,
1586
- # None,
1587
- # None,
1588
- # None,
1589
- # None,
1590
- # None,
1591
- # None,
1592
- # None,
1593
- # [58, 60],
1594
- # None,
1595
- # None,
1596
- # None,
1597
- # None,
1598
- # [276, 278],
1599
- # [58, 60],
1600
- # None,
1601
- # None,
1602
- # None,
1603
- # None,
1604
- # None,
1605
- # None,
1606
- # [177, 178],
1607
- # None,
1608
- # None,
1609
- # [347, 348],
1610
- # None,
1611
- # [310, 312],
1612
- # None,
1613
- # None,
1614
- # None,
1615
- # [183, 184],
1616
- # None,
1617
- # None,
1618
- # None,
1619
- # None,
1620
- # [381, 387],
1621
- # None,
1622
- # [13, 13],
1623
- # [310, 312],
1624
- # None,
1625
- # None,
1626
- # [310, 312],
1627
- # None,
1628
- # None,
1629
- # None,
1630
- # None,
1631
- # None,
1632
- # None,
1633
- # [434, 435],
1634
- # [310, 312],
1635
- # None,
1636
- # None,
1637
- # None,
1638
- # [310, 312],
1639
- # None,
1640
- # None,
1641
- # None,
1642
- # None,
1643
- # [470, 485],
1644
- # None,
1645
- # None,
1646
- # [407, 409],
1647
- # None,
1648
- # None,
1649
- # None,
1650
- # None,
1651
- # [58, 60],
1652
- # None,
1653
- # None,
1654
- # None,
1655
- # None,
1656
- # [520, 522],
1657
- # None,
1658
- # None,
1659
- # None,
1660
- # None,
1661
- # None,
1662
- # None,
1663
- # None,
1664
- # None,
1665
- # None,
1666
- # None,
1667
- # None,
1668
- # None,
1669
- # None,
1670
- # [520, 522],
1671
- # None,
1672
- # [310, 312],
1673
- # None,
1674
- # [445, 445],
1675
- # [58, 60],
1676
- # None,
1677
- # None,
1678
- # None,
1679
- # None,
1680
- # [510, 511],
1681
- # None,
1682
- # [94, 96],
1683
- # None,
1684
- # None,
1685
- # None,
1686
- # None,
1687
- # None,
1688
- # None,
1689
- # None,
1690
- # None,
1691
- # [347, 348],
1692
- # None,
1693
- # None,
1694
- # None,
1695
- # None,
1696
- # None,
1697
- # [195, 196],
1698
- # None,
1699
- # None,
1700
- # None,
1701
- # None,
1702
- # [224, 225],
1703
- # None,
1704
- # None,
1705
- # None,
1706
- # None,
1707
- # None,
1708
- # None,
1709
- # None,
1710
- # None,
1711
- # None,
1712
- # None,
1713
- # None,
1714
- # None,
1715
- # None,
1716
- # None,
1717
- # None,
1718
- # None,
1719
- # None,
1720
- # None,
1721
- # None,
1722
- # None,
1723
- # None,
1724
- # None,
1725
- # None,
1726
- # None,
1727
- # None,
1728
- # None,
1729
- # None,
1730
- # None,
1731
- # None,
1732
- # None,
1733
- # None,
1734
- # None,
1735
- # None,
1736
- # None,
1737
- # [419, 421],
1738
- # None,
1739
- # [610, 611],
1740
- # None,
1741
- # None,
1742
- # None,
1743
- # None,
1744
- # None,
1745
- # None,
1746
- # None,
1747
- # None,
1748
- # None,
1749
- # None,
1750
- # None,
1751
- # None,
1752
- # None,
1753
- # None,
1754
- # None,
1755
- # None,
1756
- # None,
1757
- # None,
1758
- # None,
1759
- # None,
1760
- # None,
1761
- # None,
1762
- # None,
1763
- # None,
1764
- # None,
1765
- # None,
1766
- # None,
1767
- # None,
1768
- # None,
1769
- # None,
1770
- # None,
1771
- # None,
1772
- # None,
1773
- # None,
1774
- # None,
1775
- # None,
1776
- # None,
1777
- # None,
1778
- # None,
1779
- # None,
1780
- # None,
1781
- # None,
1782
- # None,
1783
- # None,
1784
- # None,
1785
- # None,
1786
- # None,
1787
- # None,
1788
- # None,
1789
- # None,
1790
- # None,
1791
- # None,
1792
- # None,
1793
- # None,
1794
- # None,
1795
- # None,
1796
- # None,
1797
- # None,
1798
- # None,
1799
- # None,
1800
- # None,
1801
- # None,
1802
- # None,
1803
- # None,
1804
- # None,
1805
- # None,
1806
- # None,
1807
- # None,
1808
- # None,
1809
- # None,
1810
- # None,
1811
- # None,
1812
- # None,
1813
- # None,
1814
- # None,
1815
- # None,
1816
- # None,
1817
- # None,
1818
- # None,
1819
- # None,
1820
- # None,
1821
- # None,
1822
- # None,
1823
- # None,
1824
- # None,
1825
- # None,
1826
- # None,
1827
- # None,
1828
- # None,
1829
- # None,
1830
- # None,
1831
- # None,
1832
- # None,
1833
- # None,
1834
- # None,
1835
- # None,
1836
- # None,
1837
- # None,
1838
- # None,
1839
- # None,
1840
- # [310, 312],
1841
- # None,
1842
- # None,
1843
- # None,
1844
- # None,
1845
- # [381, 387],
1846
- # None,
1847
- # None,
1848
- # [13, 13],
1849
- # [869, 870],
1850
- # None,
1851
- # None,
1852
- # None,
1853
- # None,
1854
- # [310, 312],
1855
- # None,
1856
- # None,
1857
- # None,
1858
- # None,
1859
- # None,
1860
- # None,
1861
- # None,
1862
- # [310, 312],
1863
- # None,
1864
- # None,
1865
- # None,
1866
- # None,
1867
- # [38, 39],
1868
- # None,
1869
- # None,
1870
- # [875, 875],
1871
- # None,
1872
- # None,
1873
- # None,
1874
- # [46, 49],
1875
- # None,
1876
- # None,
1877
- # None,
1878
- # None,
1879
- # None,
1880
- # None,
1881
- # None,
1882
- # None,
1883
- # None,
1884
- # None,
1885
- # None,
1886
- # None,
1887
- # None,
1888
- # None,
1889
- # None,
1890
- # None,
1891
- # None,
1892
- # None,
1893
- # None,
1894
- # None,
1895
- # ],
1896
- # }
 
6
  doc = [
7
  "The Half Moon is a public house and music venue in Putney, London. It is one of the city's longest running live music venues, and has hosted live music every night since 1963.",
8
  "The pub is on the south side of the Lower Richmond road, in the London Borough of Wandsworth.",
 
 
 
 
 
9
  ]
10
+ PARAGRAPH_DELIMITER = "\n\n"
11
+ doc_string = PARAGRAPH_DELIMITER.join(doc)
12
 
13
  # prepare sample payload
14
+ payload = doc_string
15
  prediction = my_handler(payload)
16
 
17
  # show results
18
  print(prediction)