News

Apache Flink, a distributed in-memory data processing framework project born out of Germany, this week graduated the Apache Incubator stage and became a Top-Level Project at the open source software ...
Today, Samsung Next, the venture capital arm of Samsung Electronics, announced it is making a strategic investment in Seattle-based Expanso, a startup looking to power distributed data processing ...
Written in Java, Hive is a specialized execution front end for Hadoop. Hive lets you write data queries in an SQL-like language. You're using Hadoop, but it feels like you're talking SQL to a ...
If you haven’t heard of Flink until now, get ready for the deluge. As one of a stream of Apache incubator-to-top-level projects turned commercial effort, the data processing engine’s promise is to ...
Even when designing a Minimum Viable Architecture (MVA), developers must consider resource location, especially when mobile apps are part of a distributed system. Distributing the data and ...
It is time to stop the stampede to create capacity to analyze big data and instead pursue a more balanced approach that focuses on finding more data sets and understanding how to use them to ...
Data today is too distributed and too large to move it to where the processing is performed. Now, businesses must find ways to move the processing close to the data—and the closer, the better.
Still, Hive is an ideal express-entry into the large-scale distributed data processing world of Hadoop. All the ease of SQL with all the power of Hadoop — sounds good to me.
Even though new technologies are appearing all the time, the aggregate direction of travel of the past decade has been clear: away from local, distributed data processing and toward cloud storage ...
Expanso secures $7.5M for it's open-source software Bacalhau, advancing enterprise edge data processing, backed by General Catalyst & Hetz Ventures.