IBM Industry Development Environment for Apache Spark
IBM Cloud is charting its ways as a hub to help Data Scientists analyze big data quickly and simply using Apache Spark.
The company is focusing henceforth on its new cloud-based development environment for near
real-time, high performance analytics, giving data scientists the ability to
access and ingest data and deliver insight-driven models to developers.
Available on the IBM Cloud
Bluemix platform, the Data ScienceExperience can provide 250 curated data sets, open source tools and a
collaborative workspace to help data scientists uncover and share meaningful
insights with developers.
One can observe that, IBM created the Data Science Experience with
the goal to extend the speed and agility of Spark to more than two million
members of the R community through new contributions to SparkR, SparkSQL and Apache
SparkML.
For those are unfamiliar, the Data Science Experience’s open and
collaborative environment allows data scientists to accelerate and simplify
data ingestion, curation and analysis by bringing together the content, data,
models, and open source resources from IBM and others including H2O, RStudio,
Jupyter Notebooks on Apache Spark in a single security-rich managed
environment.