Upgrade & Secure Your Future with DevOps, SRE, DevSecOps, MLOps!
We spend hours on Instagram and YouTube and waste money on coffee and fast food, but wonāt spend 30 minutes a day learning skills to boost our careers.
Master in DevOps, SRE, DevSecOps & MLOps!
Learn from Guru Rajesh Kumar and double your salary in just one year.
Source: siliconangle.com
Google LLC is aiming to make it easier for its cloud customers to deploy and run open-source software projects such as Apache Spark via a new version, released today, of its Cloud Dataproc service running on Kubernetes.
Cloud DataprocĀ is a four-year-old service that allows users take advantage of open-source data tools such as Apache Hadoop and Spark for batch processing, querying, streaming and machine learning tasks.
ItĀ provides open-source data and analytics processing capabilitiesĀ for data engineers and data scientists who need to process information and train models faster at scale. It comes with automation tools that allow clusters to be created quickly, along with the ability to save money by turning clusters off when theyāre not needed.
Kubernetes is a popular open-source software framework thatās used to manage large clusters of containers. Containers in turn are used to host the components of modern applications that can run on any infrastructure platform.
By combining Cloud Dataproc with Kubernetes, Google is enabling data scientists to unify resource management, isolate jobs and build resilient infrastructures across any environment, the company said in an announcement. Their open-source workloads also become much more portable.
āThe overall idea Google has with its cloud services is to combine the best of Google Cloud and open source,ā James Malone, Googleās product manager for managed services on open source software, told SiliconANGLE in an interview.
Malone explained that many customers face challenges in running open source software as it requires significant expertise, not just with the bewildering array of components itās made of, but with the entire ecosystem.
āThe open-source stack is very complicated,ā Malone said. āDataproc is the first managed service to take these open-source components and make them work on Kubernetes.ā
Open-source jobs therefore become much simpler on Cloud Dataproc on Kubernetes. The service does away with the need to work with two separate cluster management interfaces to manage open source components, for example.
āUsing Dataprocās new capabilities, Google will give you one central view that can span both cluster management systems,ā Google explained in its pitch. āSupporting both YARN and Kubernetes will give enterprises the flexibility they need to modernize certain hybrid workloads while continuing to monitor YARN-based workloads.ā
The other main benefit is that users can containerize and isolate open-source software jobs on Kubernetes. This means their machine learning models and extract, transact and load pipelines can be moved from development to production without any compatibility problems. It also means customers can stop worrying about being locked in to a single environment.
āMoving to Kubernetes prevents lock-in,ā Malone said. āSo [customers] take jobs and run them on Amazon Elastic MapReduce for example. Itās easier to do with Kubernetes because containers are highly portable.ā
Moreover, Cloud Dataproc on Kubernetes provides what Google calls a āself-healing environmentā where infrastructure management tasks such as sizing and building clusters, manipulating Docker files and network configuration are all automated.
Cloud Dataproc on Kubernetes is currently available as an early āalphaā preview. At present it only works with Apache Spark but Google is planning to add more open source software projects, including Apache Flink, in the future.
Since youāre here ā¦
ā¦ Weād like to tell you about our mission and how you can help us fulfill it. SiliconANGLE Media Inc.ās business model is based on the intrinsic value of the content, not advertising. Unlike many online publications, we donāt have a paywall or run banner advertising, because we want to keep our journalism open, without influence or the need to chase traffic.The journalism, reporting and commentary onĀ SiliconANGLEĀ ā along with live, unscripted video from our Silicon Valley studio and globe-trotting video teams atĀ theCUBEĀ ā take a lot of hard work, time and money. Keeping the quality high requires the support of sponsors who are aligned with our vision of ad-free journalism content.
If you like the reporting, video interviews and other ad-free content here,Ā please take a moment to check out a sample of the video content supported by our sponsors,Ā tweet your support, and keep coming back toĀ SiliconANGLE.