Click to learn more about author Aditi Raiter. Self-service BI is lucrative but demands an eye on governance and security. In this article, I will stress the behaviors and strategies needed around managing self-serve BI/reporting tools. Self-service analytics rotates around business users having access to data preparation and dashboard building tools. There are many challenges […]
Rebound from Recession with the Power of an AI Strategy
Click here to learn more about Dr. Tommy Weir. Among the many tough questions that this latest — and potentially greatest — recession is throwing at businesses, is what their AI strategy should look like, both in order to navigate the downturn and to bounce back when it’s all over. Needless to say, there is […]
Guided Labeling Episode 4: From Exploration to Exploitation
Click to learn more about author Paolo Tamagnini. One of the key challenges in using supervised machine learning for real world use cases is that most algorithms and models require a sample of data that is large enough to represent the actual reality your model needs to learn. These data need to be labeled. These […]
What is the KMeans Clustering Algorithm and How is it Used to Analyze Data?
Click to learn more about author Kartik Patel. This article provides a brief explanation of the KMeans Clustering algorithm. What is the KMeans Clustering algorithm? The KMeans Clustering algorithm is a process by which objects are classified into number of groups so that they are as much dissimilar as possible from one group to another, and […]
Mise en Place for Data Science
Click to learn more about author Curt Bergmann. When guests arrive at a great restaurant, the chef and all the cooks have already planned and assembled everything they need to quickly deliver excellence on a plate. Their process, called mise en place, is used by chefs all over the world. Emerging after the introduction of […]
How to Transform into a Data-Driven Organization?
Click to learn more about author Konain Qurban. It is a journey to ensure the alignment of analytics initiatives to organizational objectives, combined with consistent and effective coordination of activities across all business units. The road from a pile of raw data to insights and from insights to action is paved with strategic goals. More […]
K-Anonymization: An Introduction for First Graders
Click to learn more about author John Murray. Now that privacy-enhancing technologies (PETs) have become a subject of dinner table conversations, our research team continues to field questions on these complex topics which can be difficult to explain. As part of this series, I will attempt to explain another PET, K-anonymization, in a first-grade context: […]
Cloud Data Warehouse vs. Cloud Data Lake: Overall Resource Efficiency and Productivity
Click here to learn more about Jason Nadeau. Since the COVID-19 crisis began, IT budgets have become tighter, driving technology leaders to figure out ways to do more with less. Data-driven enterprises cannot simply afford to sunset data modernization or data analytics projects. In times like this, several industry verticals (insurance, financial services, and healthcare) […]
How to Handle Semi-Structured Data: JSON Functions
Click to learn more about author Eva Murray. The world has seen an explosion in data with an incredible amount of data being produced every single day (2.5 quintillion bytes, an almost incomprehensible number). Much of this data is semi-structured or unstructured data, stemming from the content produced on social media platforms in the form […]
Guided Labeling Episode 3: Model Uncertainty
Click to learn more about author Paolo Tamagnini. In this series, we’ve been exploring the topic of guided labeling by looking at active learning and label density. In the first episode, we introduced the topic of active learning and active learning sampling and moved on to look at label density in the second article. Here […]