score>1000
for 2013 and 2014 (~5000)
sqlite3
, requests
, bs4
, pandas
, numpy
, scikit-learn
, gensim
, praw
, wikipedia
, nltk
, stemmming.porter2
>> "Good muffins cost $3.88\n in New York"
['Good', 'muffins', 'cost', 'TOKEN_MONEY', 'in', 'New', 'York', 'TOKEN_EOS']
>> "I sat on the rock"
['I', 'sat', 'on', 'rock']
>> stem("factionally")
'faction'
TF-IDF
(term frequency-inverse document frequency)
TF-IDF
>>> model.most_similar(positive=['woman', 'king'], negative=['man'])
[('queen', 0.50882536), ...]
>>> model.doesnt_match("breakfast cereal dinner lunch".split())
'cereal'
>>> model.similarity('woman', 'man')
0.73723527
>>> model['computer'] # raw numpy vector of a word
array([-0.00449447, -0.00310097, 0.02421786, ...], dtype=float32)
Uses far fewer features to store relationships between words!
Training classifier
Test Accuracy: 0.878; Test Accuracy on TP: 0.116; Test Accuracy on TN: 0.998
"Bubble wrap" is a generic trademark owned by Sealed Air Corporation. In 1957 two inventors named Alfred Fielding and Marc Chavannes were attempting to create a three-dimensional plastic wallpaper. Although the idea was a failure, they found that what they did make could be used as packing material. Sealed Air Corp. was co-founded by Alfred Fielding.
Julia Stewart, who originally worked as a waitress at IHOP and worked her way up through the restaurant industry, became Chief Executive Officer of IHOP Corporation. She had previously been President of Applebee’s, but left after being overlooked for that company's CEO position. She became CEO of IHOP in 2001, and returned to manage her old company due to the acquisition.
Martin is opposed to fan fiction, believing it to be copyright infringement and a bad exercise for aspiring writers.