Advertisement

Frequencies in R — Part 2

Click here to learn more about author Steve Miller. In last month’s blog, I compared several functions that compute frequencies and crosstabs in R. The ones I’ve worked with primarily, and the foci of Part 1, were table from the base package, xtabs from the stats package, and count from Hadley Wickham’s plyr package. Tests were conducted on a data set […]

Frequencies in R — Part 1

Click here to learn more about author Steve Miller. I’m often asked to name the most common statistical procedure used in my company’s Data Science work. My answer, only partly in jest, is frequencies and crosstabs — to help with the mundane tasks of profiling and exploring data. Indeed frequency distributions and the dotplots that showcase […]

New Jobs Analysis with Python

Click here to learn more about author Steve Miller. The presidential race is heating up as primaries come to an end. And if it’s Trump vs Clinton, there’ll be no shortage of strong opinion among the electorate as to which offers the best policies for economics, defense, energy, health care, etc. Last year I posted […]

Web Scraping for Data Science — Part 2

Click here to learn more about author Steve Miller. Read Part 1 of this blog series here. Between R and Python, analytics pros are covered on most data science bases R-Python. In last month’s blog, I discussed simple webscraping using Python in a Jupyter notebbok, the nifty css-generating tool SelectorGadget, and the Python XML and HTML handling package lxml. […]