The data science landscape has changed considerably in the last few months. The rise of AI tools and user-friendly data platforms has democratized access to data-driven insights, giving even non-experts access to an essential toolkit that enhances efficiency and decision-making. As data science becomes more integral to business operations, employers expect data literacy from more […]
Why Geospatial Data Should Be Easily Accessible for Every Employee
Unlocking the power of geospatial data can give organizations a competitive edge, from optimizing supply chain logistics and enhancing customer experience to mitigating fraud and improving public health outcomes. But despite its far-reaching benefits, many organizations fail to fully harness geospatial data’s potential. Why? Because geospatial data is voluminous, complex, and often distributed across multiple […]
Ten Common Issues When Using Excel for Data Operations
Click to learn more about co-author Rosaria Silipo. Using Excel to Transform/Analyze Your Data? I know you are still using Excel sheets to transform and/or analyze your data! I know, because most of us still use it to some extent. There is nothing wrong with using Excel. Excel spreadsheets are a great tool to collect […]
JS Show It! Interactive Choropleth World Map Using GeoChart Visualization
Click to learn more about co-author Rosaria Silipo. Click to learn more about co-author Paolo Tamagnini. The Plot Today we want to draw the choropleth map as shown above. So, what do we need? A map of the countries of the world and the corresponding numbers of their populations A short JavaScript code to load […]
Data Chef ETL Battles: WebLog Data for Clickstream Analysis
Click to learn more about co-author Maarit Widmann. Click to learn more about co-author Anna Martin. Click to learn more about co-author Rosaria Silipo. Do you remember the Iron Chef battles? It was a televised series of cook-offs in which famous chefs rolled up their sleeves to compete in making the perfect dish. Based on a […]
Ensemble Models: Bagging and Boosting
Click to learn more about author Rosaria Silipo. Ensemble models combine multiple learning algorithms to improve the predictive performance of each algorithm alone. There are two main strategies to ensemble models — bagging and boosting — and many examples of predefined ensemble algorithms. Bootstrap aggregation, or bagging, is an ensemble meta-learning technique that trains many […]
Deploying the Obscure Python Script: Neuro-Styling of Portrait Pictures
Click to learn more about author Rosaria Silipo. Click to learn more about co-author Mykhailo Lisovyi. Today’s Style: Caravaggio or Picasso? While surfing on the internet a few months ago, we came across this study[i], promising to train a neural network to alter any image according to your preferred painter’s style. These kinds of studies […]
From a Single Decision Tree to a Random Forest
Click to learn more about author Rosaria Silipo. The co-author of this column was Kathrin Melcher. Decision trees are a set of very popular supervised classification algorithms. They are very popular for a few reasons: They perform quite well on classification problems, the decisional path is relatively easy to interpret, and the algorithm to build […]
The 80/20 Challenge: From Classic to Innovative Data Science Projects
Click to learn more about author Rosaria Silipo. Sometimes when you talk to data scientists, you get this vibe as if you’re talking to priests of an ancient religion. Obscure formulas, complex algorithms, a slang for the initiated, and on top of that, some new required script. If you get these vibes for all projects, […]
Automated Machine Learning: Just How Much?
Click to learn more about author Rosaria Silipo. An interview with three data scientists and guided automation experts There is currently a lot of talk about automated machine learning. There is also a high level of skepticism. I am here with data scientists Paolo Tamagnini, Simon Schmid and Christian Dietz.. to ask a few questions […]