SAMPLE DATA: https://docs.google.com/spreadsheets/d/1s6MzBu5lFcc-uUZ9B6CI1YR7P1fDSm4cByFwKt3ckgc/edit?usp=sharing I have this function that uses textacy to extract the source attribution. This automatically returns the speaker, cue and content of the quotes. In my dataset, some paragraphs have several quotations, but I only need the first one, that’s why I put the BREAK in the for loop. My problem now is that some of original data