Click to learn more about author Paolo Tamagnini. In this series, we’ve been exploring the topic of guided labeling by looking at active learning and label density. In the first episode, we introduced the topic of active learning and active learning sampling and moved on to look at label density in the second article. Here […]
Location Analytics: Improving User Acquisition, Conversion, and Retention
Click to learn more about author Aditi Sinha. “Location” is critical for app-based delivery, logistics, and mobility companies. Who is this for? An app-based delivery company delivers whatever users order through their app right at the user’s location, anytime. On-demand companies, such as food delivery companies, supply chain companies like e-commerce businesses, hyperlocal delivery companies, […]
What Is a Data Engineer?
Data engineers build Data Architecture through infrastructures and foundations. A data engineer is tasked with designing and maintaining the architecture of data systems, which incorporates concepts ranging from analytic frameworks to data warehouses. Responsibilities also include configuring, managing, and scaling data pipelines. Data engineers: Have a programming background (e.g. Java, Scala or Python) Emphasize distributed […]
Enterprise Data World 2021 – San Diego, CA
DATE: April 18-23, 2021 LOCATION: San Diego, California VENUE: Sheraton San Diego 1380 Harbor Island Drive San Diego, California 92101 ABOUT THE EVENT: The 25th Annual Enterprise Data World (EDW) Conference hosted by DATAVERSITY® is recognized as the most comprehensive educational conference on Data Management in the world. Attend classroom-sized workshops, in-depth tutorials, case study sessions, and […]
Guided Labeling Episode 2: Label Density
Click to learn more about author Paolo Tamagnini. The Guided Labeling series of blog posts began by looking at when labeling is needed — i.e., in the field of machine learning when most algorithms and models require huge amounts of data with quite a few specific requirements. These large masses of data need to be […]
No More Iris: Three Ways to Become a Better Data Scientist
Click here to learn more about Ted Kwartler. Each semester I am tempted to teach my Harvard Extension students using standard data sets like Iris or Titanic. It would be easy with so many examples floating around. Plus, the explanation of a K-Nearest Neighbor algorithm fits perfectly into an Iris world. But honestly, real-world data […]
AI Strategy: Using the 715 Framework to Build High Value Big Data
Click to learn more about author Asha Saxena. As I predicted that the virtual chief AI officer will be a trend in my last article. AI has become critical as we enter Industry 4.0. More than ever businesses need to pivot with the most massive shift towards the digital economy that has accelerated by the force […]
Guided Labeling Episode 1: An Introduction to Active Learning
Click to learn more about author Paolo Tamagnini. One of the key challenges of utilizing supervised machine learning for real-world use cases is that most algorithms and models require lots of data with quite a few specific requirements. First of all, you need to have a sample of data that is large enough to represent […]
The Right Data is the New Black, and How Leading Brands Obtain It
Click to learn more about author Julius Cerniauskas. The digital universe is expanding fast. According to a SINTEF study conducted back in 2013, 90 percent of all the world’s data was generated from 2011 to 2012. By 2020, the number of data bits created globally was expected to reach 44 zettabytes, as predicted by the […]
JS Show It! Interactive Choropleth World Map Using GeoChart Visualization
Click to learn more about co-author Rosaria Silipo. Click to learn more about co-author Paolo Tamagnini. The Plot Today we want to draw the choropleth map as shown above. So, what do we need? A map of the countries of the world and the corresponding numbers of their populations A short JavaScript code to load […]