Rant Image

The Risk

Which platform is more efficient than Apache Spark or Hadoop?

Submitted by karine » Tue 16-Jul-2019, 17:47

Subject Area: General

5 member ratings

Which data analytics platform is more efficient than Apache Spark or Hadoop? Is it possible to compare the pros and cons of these tools?


9 Comments 

Member Comments

RE: Which platform is more efficient than Apache Spark or Hadoop?

Spark

By mh6562086 » Thu 30-Jan-2020, 10:14, My rating: ✭ ✭ ✭ ✭ ✩

Apache Crunch is such ... generally map-reduce on steroids. Able to run locally, on Yarn and on Spark.
It is close to Spark in ideology, and is similar in API, slightly, but SQL (and optimization) does not know how. But Avro knows how, parquet, understands the scheme, and has sources and receivers, say for HBase. We at the company Innovecs no longer use it.
Not a bad tool in general, but I'm afraid that he died. Committing once every few months is a symptom, however.

9 Comments  • Page 4 of 9 •        Previous « 1…  2   3   4   5   6  …9 » Next

Email to a friend

Email this Risk Statement to a friend

%0ASee:%0A http://www.chambers.com.au/forum/view_post.php?frm=3%26pstid=1926" alt="Email to a friend" />