Palo Alto, CA, USA
Sep 13, 2021   |  By Viktor Somogyi-Vass
There are two big gaps in the Apache Kafka project when we think of operating a cluster. The first is monitoring the cluster efficiently and the second is managing failures and changes in the cluster. There are no solutions for these inside the Kafka project but there are many good 3rd party tools for both problems. Cruise Control is one of the earliest open source tools to provide a solution for the failure management problem but lately for the monitoring problem as well.
Sep 10, 2021   |  By Jonathan Hsieh
Shared Data Experience (SDX) on Cloudera Data Platform (CDP) enables centralized data access control and audit for workloads in the Enterprise Data Cloud. The public cloud (CDP-PC) editions default to using cloud storage (S3 for AWS, ADLS-gen2 for Azure). This introduces new challenges around managing data access across teams and individual users. To solve these challenges for S3 and ADLS-gen2, Cloudera has introduced a new service — the Ranger Authorization Service (RAZ).
Sep 9, 2021   |  By Andreas Skouloudis
The CDP Operational Database (COD) builds on the foundation of existing operational database capabilities that were available with Apache HBase and/or Apache Phoenix in legacy CDH and HDP deployments.
Sep 8, 2021   |  By Daniel Hand
In recent years there has been increased interest in how to safely and efficiently extend enterprise data platforms and workloads into the cloud. CDOs are under increasing pressure to reduce costs by moving data and workloads to the cloud, similar to what has happened with business applications during the last decade. Our upcoming webinar is centered on how an integrated data platform supports the data strategy and goals of becoming a data-driven company.
Sep 2, 2021   |  By Marton Mayer
The shift to cloud has been accelerating, and with it, a push to modernize data pipelines that fuel key applications. That is why cloud native solutions which take advantage of the capabilities such as disaggregated storage & compute, elasticity, and containerization are more paramount than ever. At Cloudera, we introduced Cloudera Data Engineering (CDE) as part of our Enterprise Data Cloud product — Cloudera Data Platform (CDP) — to meet these challenges.
Sep 1, 2021   |  By Cloudera Contributors
The more an enterprise wants to know about itself and its business prospects, the more data it needs to collect and analyze. Additionally, the more data it collects and stores, the better its ability to know customers, to find new ones, and to provide more of what they want to buy. Sounds simple, but a surprising majority of U.S.
Aug 31, 2021   |  By Vinay Rayker
Cloudera and Accenture demonstrate strength in their relationship with an accelerator called the Smart Data Transition Toolkit for migration of legacy data warehouses into Cloudera Data Platform.
Aug 26, 2021   |  By George Huang
Apache Ozone is a scalable distributed object store that can efficiently manage billions of small and large files. Ozone natively provides Amazon S3 and Hadoop Filesystem compatible endpoints in addition to its own native object store API endpoint and is designed to work seamlessly with enterprise scale data warehousing, machine learning and streaming workloads. The object store is readily available alongside HDFS in CDP (Cloudera Data Platform) Private Cloud Base 7.1.3+.
Aug 25, 2021   |  By Cloudera Contributors
COVID-19 vaccines were developed in record time. One of the main reasons for the accelerated development was the quick exchange of data between academia, healthcare institutions, government agencies, and nonprofit entities. “COVID research is a great example of where sharing data and having large quantities of data to analyze would be beneficial to us all,” said Renee Dvir, solutions engineering manager at Cloudera.
Aug 21, 2021   |  By Mark Micallef
There is an urgent need for banks to be nimble and adaptable in the thick of a multitude of industry challenges, ranging from the maze of regulatory compliance, sophisticated criminal activities, rising customer expectations and competition from traditional banks and new digital entrants. As banks find their bearings in this landscape, what appear to be insurmountable odds are in fact opportunities for growth and competitive differentiation.
Sep 10, 2021   |  By Cloudera
SQL Stream Builder, part of Cloudera Streaming Analytics, allows developers, analysts, and data scientists to write streaming applications using industry-standard SQL. It provides an interactive experience, so the development process is quick, easy, and productive while taking advantage of Apache Flink’s streaming power. It provides an advanced materialized view engine to interface with applications, tooling, and services via REST API.
Sep 8, 2021   |  By Cloudera
Join us live to hear the latest and greatest features we've released in CDP Private Cloud. This session will discuss new features in CDP Private Cloud Base 7.1.7 + 4 paths to upgrade to CDP. Don't miss your chance for live Q&A!
Aug 27, 2021   |  By Cloudera
A demo video by Vinay Rayker
Aug 26, 2021   |  By Cloudera
Join us LIVE to discuss what’s new in CDP Public Cloud! Don’t miss the live Q&A as we learn about Natural Language Processing on Data Viz and the recently released Cloudera DataFlow on Public Cloud.
Aug 16, 2021   |  By Cloudera
Cloudera DataFlow for the Public Cloud takes away the operational and monitoring challenges by providing cloud-native flow management capabilities powered by Apache NiFi. It is a purposely built framework to modernize the data flow user experience so that the NiFi developers and administrators can be prepared to easily handle sophisticated data flows in production.
Aug 6, 2021   |  By Cloudera
In today’s video, we’re going to take a workload that trains an image classification neural net and show how you can use Cloudera Machine Learning to leverage Nvidia GPUs to achieve impressive speed improvements without any substantial code changes.
Aug 5, 2021   |  By Cloudera
NVIDIA and Cloudera have partnered to bring NVIDIA GPU acceleration to the Cloudera Data Platform (CDP). In this quick demo, you can see how using GPUs can cut the time of an example ETL Spark job from over two hours to just a few minutes.
Aug 3, 2021   |  By Cloudera
Cloudera Operational Database (COD) is an operational database as a service that brings ease of use and flexibility. Let’s see how easy it is to create a new database! Once you have created your environment, navigate to the COD Web interface. It takes you to the Databases page. Click Create Database, select the applicable environment, provide a name for your database and click Create Database. The creation of your new database is in progress. Once its status becomes Available it is ready to be used.
Jul 22, 2021   |  By Cloudera
This video describes a way to hide the plain text passwords in Knox topology. The passwords are basically the Bind User passwords when used in ShiroProvider and LDAP/AD authentication.
Jul 15, 2021   |  By Cloudera
Join us to talk about Replication Manager, RBAC, and Custom Images - all things to make your life easier. Don't miss the live Q&A during the session!
Jun 28, 2018   |  By Cloudera
Enterprises require fast, cost-efficient solutions to the familiar challenges of engaging customers, reducing risk, and improving operational excellence to stay competitive. The cloud is playing a key role in accelerating time to benefit from new insights. Managed cloud services that automate provisioning, operation, and patching will be critical for enterprises to leverage the full promise of the cloud when it comes to time to value and agility.
Jun 26, 2018   |  By Cloudera
The adoption of cloud computing in the financial services sector has grown substantially in the past three years on a global basis. Diversification of risk is always a key concern for financial institutions and the seeming safety of having a single cloud provider is not being properly measured from a systemic risk and operational risk perspective.
Jun 12, 2018   |  By Cloudera
This white paper provides a reference architecture for running Enterprise Data Hub on Oracle Cloud Infrastructure. Topics include installation automation, automated configuration and tuning, and best practices for deployment and topology to support security and high availability.
May 17, 2018   |  By Cloudera
A cloud-based analytics platform needs to be easy, unified, and enterprise-grade to meet the demands of your business. This white paper covers how Cloudera's machine learning and analytics platform complements popular cloud services like Amazon Web Services (AWS) and Microsoft Azure, and enables customers to organize, process, analyze, and store data at large scale...anywhere.
May 15, 2018   |  By Cloudera
The Modern Platform for Machine Learning and Analytics Optimized for Cloud.
Mar 25, 2018   |  By Cloudera
In the wake of the global financial crisis, the world has become much more interconnected and immensely more complex. As a result, you can no longer simply look at the past as an indicator of future trends. The financial services industry needs real-time insights into numerous interacting variables to make informed decisions.

Cloudera delivers the modern platform for machine learning and analytics optimized for the cloud. Imagine having access to all your data in one platform. The opportunities are endless. We enable you to transform vast amounts of complex data into clear and actionable insights to enhance your business and exceed your expectations.

The right products for the job:

  • Enterprise Data Hub: Operate with confidence—thanks to comprehensive security and governance—while at the same time enabling unrivaled self-service performance at extreme scale. All in an enterprise-grade solution that lets you run anywhere, on-premises or in hybrid- and multi-cloud environments.
  • Data Science Workbench: Accelerate machine learning from research to production with the secure, self-service enterprise data science platform built for the enterprise.
  • Data Warehouse: A modern data warehouse that delivers an enterprise-grade, hybrid cloud solution designed for self-service analytics.
  • Data Science & Engineering: Cloudera Data Science provides better access to Apache Hadoop data with familiar and performant tools that address all aspects of modern predictive analytics.
  • Altus Cloud: The industry’s first machine learning and analytics cloud platform built with a shared data experience.

The world’s leading organizations choose Cloudera to grow their businesses, improve lives, and advance human achievement.