codesnippets
To share, link to, and discuss bits of code that actually do DH-y things, or at least try to. And to ask questions along the lines of, "What's the best script-y way to <fill in short short task here>."
2015-10-23
2015-10-24
2015-10-25
2015-10-26
2015-10-27
2015-10-28
data:image/s3,"s3://crabby-images/0d15d/0d15db04c16340aead27a60ff071ad0d91afd018" alt=""
Hey, soupy people, can I ask a Beautiful Soup question? What would you use to grab the paragraph beginning with “Cast:” on a page like this? I can’t figure out how to grab content by using the thing it starts with and I don’t seem to be Googling the right thing. http://silentera.com/PSFL/data/S/SymbolOfTheUnconquered1920.html
data:image/s3,"s3://crabby-images/bf247/bf247595129005074e32fa41e7cfcd28861dd1f0" alt=""
@miriamposner: it’s been a while since i’ve used Beautiful Soup and I’m sure there is a better way, but here is a quick and dirty way for a one-off
>>> soup = BeautifulSoup(html_doc, 'html.parser')
>>> paragraphs = soup.find_all('p')
>>> for p in paragraphs:
... if "Cast" in str(p):
... print p
...
<p>Cast: Iris Hall [Evon Mason], Walker Thompson [Hugh Van Allen], Lawrence Chenault, Jim Burris, Mattie Wilkes, E.G. Tatum, Leigh Whipper, James Burrough, George Catlin</p>
data:image/s3,"s3://crabby-images/0d15d/0d15db04c16340aead27a60ff071ad0d91afd018" alt=""
@jay.varner: Yes! That’s exactly what I needed. Thanks so much! I really appreciate it.
data:image/s3,"s3://crabby-images/bf247/bf247595129005074e32fa41e7cfcd28861dd1f0" alt=""
glad i could help
2015-10-29
data:image/s3,"s3://crabby-images/319e2/319e2dc1cd42123aba2a85eba36c67067f01edf7" alt=""
data:image/s3,"s3://crabby-images/319e2/319e2dc1cd42123aba2a85eba36c67067f01edf7" alt=""
@sfsheath: testing github/gist integration. A little wonky but seems work ok.
data:image/s3,"s3://crabby-images/32e26/32e265c5ddeb0f7accecf768d3f28449185942bb" alt=""
Wonky is good.
2015-11-02
data:image/s3,"s3://crabby-images/883ad/883ad8520e2f7408c1307dae1449a8450d14876e" alt=""
data:image/s3,"s3://crabby-images/883ad/883ad8520e2f7408c1307dae1449a8450d14876e" alt=""
2015-11-03
2015-11-04
2015-11-05
2015-11-06
2015-11-10
2015-11-12
data:image/s3,"s3://crabby-images/883ad/883ad8520e2f7408c1307dae1449a8450d14876e" alt=""
has anyone ever written A Big Book Of Escaping Rules? Because I’d buy it.
data:image/s3,"s3://crabby-images/eae53/eae535a10d9c1ce1a02eb43a40033e5b21baf57f" alt=""
@mdlincoln: For strings? Doesn’t it vary from language to language?
data:image/s3,"s3://crabby-images/883ad/883ad8520e2f7408c1307dae1449a8450d14876e" alt=""
“Doesn’t it vary from language to language?” precisely
data:image/s3,"s3://crabby-images/eae53/eae535a10d9c1ce1a02eb43a40033e5b21baf57f" alt=""
data:image/s3,"s3://crabby-images/eae53/eae535a10d9c1ce1a02eb43a40033e5b21baf57f" alt=""
though really, what are you looking for from that kind of document?
data:image/s3,"s3://crabby-images/eae53/eae535a10d9c1ce1a02eb43a40033e5b21baf57f" alt=""
mappings between escaping rules in various languages?
data:image/s3,"s3://crabby-images/883ad/883ad8520e2f7408c1307dae1449a8450d14876e" alt=""
At the moment: trying to interleave a perl regex substitution within a makefile
data:image/s3,"s3://crabby-images/883ad/883ad8520e2f7408c1307dae1449a8450d14876e" alt=""
I’ve fixed it (turns out you need to use double $$ when calling perl from make
data:image/s3,"s3://crabby-images/883ad/883ad8520e2f7408c1307dae1449a8450d14876e" alt=""
just idly wondering how one might design some tool that helps map out escaping issues when calling different languages
data:image/s3,"s3://crabby-images/eae53/eae535a10d9c1ce1a02eb43a40033e5b21baf57f" alt=""
here’s one that’s got a few languages: http://www.codecodex.com/wiki/Escape_sequences_and_escape_characters
data:image/s3,"s3://crabby-images/eae53/eae535a10d9c1ce1a02eb43a40033e5b21baf57f" alt=""
doesn’t really seem that much use though, and I think it’s less about escaping rules and more about string handling quirks across languages
2015-11-13
data:image/s3,"s3://crabby-images/61193/611939818b1b8601fda0b8922a6f1d3283b1f21e" alt=""
kind of unrelated but I found this infinitely useful when transitioning from java to python or when I’m just super lazy http://txt2re.com