A sense of frustration has developed among some companies wanting to develop a Big Data Strategy using Data Science and aligned technologies. They can’t find people capable of doing the work. Experts have been warning of a Data Scientist shortage for at least six years, but it’s a strange sounding, “new” career field that many […]
Data Science Education: Massachusetts Institute of Technology
MIT’s Institute for Data, Systems, and Society (IDSS) is focused on addressing a broad range of challenges by promoting research and education in Data Science education, statistics, decision making, and the social sciences. It was launched simultaneously, in the Fall of 2016, with MIT’s new Minor in Statistics and Data Science, and is available to […]
Big Data Applications: Managing Complexity with Success
In essence, Unravel Data makes processing Big Data easier. The program was designed to resolve the complicated and disconcerting problems that emerge when processing Big Data. These applications can become confusing and difficult to operate. Never-before-seen challenges arise with chronic regularity, leaving research teams constantly struggling with issues such as allocating resources, scheduling, and debugging. […]
Microservices 101: Changing the Monolithic Data Architecture Model
In May 2011, a group of software architects at a workshop near Venice used the word “microservice” to describe a process they had recently worked on. The philosophical essence of a microservice can be described as, “Focus on one task and do it well.” Microservices are a flexible, single-purpose software program essentially compatible with any […]
Data Science Education: Stanford University’s Program
The Stanford Data Science education program is small with competitive acceptance requirements. It is administrated by their Statistics Department. Guenther Walther, Chairman of the Statistics Department, was asked what students can do to get in the program. He responded: “The admissions process is competitive, indeed. We had close to 400 applicants for the eight spots […]
SQL versus NoSQL Databases: A Review of Key Themes
Non-relational databases (aka NoSQL), in many different variations, have become a popular database model for handling Big Data. They use two novel concepts, making them different from earlier, “classical” database models. The concepts having so much impact are horizontal scaling, spreading out the storage and work, and eliminating the use of “Structured Query Language” (SQL) […]
Data Science Education and Research at The University of Michigan
The University of Michigan’s Data Science education program provides a wide range of training options, which can be tailored to the individual goals of the student. They offer an individualized, interactive, data-centered training experience. The U of M Graduate Data Science Certificate Program’s primary mission is to develop a corps of skilled Data Scientists with […]
A Review of Different Database Types: Relational versus Non-Relational
Relational databases are also called Relational Database Management Systems (RDBMS) or SQL databases. Historically, the most popular of these have been Microsoft SQL Server, Oracle Database, MySQL, and IBM DB2. The RDBMS’s are used mostly in large enterprise scenarios, with the exception of MySQL, which is also used to store data for Web applications. All […]
Cloud Computing: Latest Trends, Issues, and Innovations
Cloud Computing is an essential part of many enterprises’ Data Architecture; and, not just in new enterprises, but throughout multiple industries and in companies of all sizes. Gatepoint Research has done a survey showing why. IT pros and business executives both like the advantages of Cloud storage. Access to data, after a disaster, came in […]
Big Data Processing 101: The What, Why, and How
First came Apache Lucene, which was, and still is, a free, full-text, downloadable search library. It can be used to analyze normal text for the purpose of developing an index. The index maps each term, “remembering” its location. When the term is searched for, Lucene immediately knows all the places where that term had existed. […]