Welcome!

Apache Authors: Gilad Parann-Nissany, ChandraShekar Dattatreya, Michael Meiner, MC Brown, Amy Lindberg

News Feed Item

Cloudera Qualifies Data Scientists With New Certification Program

Hands-On Certification Prepares Data Scientists for Success With Real-World Data; Data Science Challenge Begins March 31

PALO ALTO, CA -- (Marketwired) -- 03/26/14 -- Cloudera, the leader in enterprise analytic data management powered by Apache Hadoop™, today announced the industry's first hands-on data science certification, called Cloudera Certified Professional: Data Scientist (CCP: DS). Comprised of a Data Science Essentials exam, a twice-annual Data Science Challenge, and several preparatory and enablement resources, Cloudera's data scientist certification program helps developers, analysts, statisticians, and engineers get experience with relevant big data tools and techniques and validate their abilities while helping prospective employers identify elite, highly skilled data scientists. The next Cloudera Data Science Challenge begins March 31, 2014.

Industry Faces Shortage of Qualified Data Scientists

Enterprises are increasingly storing massive amounts of data in Hadoop to streamline the path to actionable insights, develop advanced analytics models, and build big data tools that were previously unattainable for most organizations. As a result, the demand for data scientists is at an all-time high. Data scientists possess a rare combination of engineering capabilities, statistical skills, and subject matter expertise that is difficult to find. Job openings for data scientists far outpace the limited supply of these highly in-demand workers, and the skills gap is widening. The situation is complicated by the fact that there has historically not been a clearly established skill set or university degree that an individual could acquire to qualify as a data scientist. Companies seeking to hire their first data scientists often have little idea what credentials to look for in a candidate.

Cloudera Addresses Demand for Data Scientists through Training and Certification

As the global leader in Hadoop training and professional certification, Cloudera is addressing the widespread industry need for data scientists with its new CCP:DS certification. Designed and led by Cloudera's own elite team of data scientists, the CCP:DS program helps aspiring data scientists develop and prove out the skills they need to succeed with real-world enterprise data.

In addition to the certification exam, the program includes an optional three-day Introduction to Data Science course focused on teaching data professionals to build machine learning models and implement complex recommender systems with Hadoop as a platform using industry-standard tools like Python and Apache Mahout. Cloudera also offers a 60-question Data Science Essentials Practice Test for candidates to self-assess their exam-readiness, and a free Data Science Challenge Solution Kit consisting of a live data set, a step-by-step tutorial, and a detailed explanation of the processes required to arrive at the correct outcomes for real-world data science questions focused on classification, clustering, and collaborative filtering of web analytics.

Once candidates have passed the Data Science Essentials exam, they must then successfully complete a Cloudera Data Science Challenge, offered twice annually. By passing Cloudera's examination and live-data challenge, CCP:DS-credentialed individuals have demonstrated their ability to work with big data and build market-relevant data science models under real-world conditions at the very highest level. Cloudera Certified Professional: Data Scientist is the world's only certification that provides evidence of true experience and expertise developing a production-ready data science solution that is peer-evaluated for accuracy, scalability, and robustness.

Introducing the Data Science Challenge: Detecting Anomalies in Medicare Claims
Cloudera's second Data Science Challenge opens on March 31, 2014. Participants will have three months to complete the challenge. Designed by Cloudera's Director of Data Science, Sean Owen, the Data Science Challenge asks aspiring data scientists to detect possible errors and anomalies in Medicare claims using a massive set of anonymized healthcare data. Successful challengers will be able to answer questions, including:

  • Which medical procedures have the highest relative variance in cost?
  • Which three providers had the highest average amount claimed for the largest number of procedures?
  • Based on amount and type of procedures claimed, which three providers and regions are least like the others?
  • Identify 10,000 patients that seem most likely to need review for possible errors or anomalies. Describe some common features in these patients.

To learn more about the Data Science Challenge or to register, please visit: http://cloudera.com/content/cloudera/en/training/certification/ccp-ds/challenge/register.html

Join us for a webinar about the current Data Science Challenge on April 10: http://go.cloudera.com/LP=385

What Data Scientists Say about CCP:DS:
"The certification program that Cloudera has put together goes beyond the written test, including a challenge that is designed to assess the data scientist skills in much greater depth than could be achieved in a multiple choice questionnaire. From my perspective, this makes the exercise much more compelling, valuable, and meaningful than any other certification available today. You are actually solving problems through data analysis in a full simulation of situations data scientists face in the field."
- Luis Quintela, Samsung SDS, Cloudera Certified Professional: Data Scientist

"CCP:DS goes a long way towards removing ambiguity about who and what a data scientist is. Being associated with Cloudera earns instant respect, as well. Because the exam is based on real-world challenges and is fully vetted by some of the world's top experts, the certification does the hard work of pre-evaluating candidates against the multiple highly technical areas that would otherwise be difficult to qualify."
- David F. McCoy, confidential employer, Cloudera Certified Professional: Data Scientist

"I'm pumped to earn the CCP:DS credential! It holds true weight in the market because it replicates a real, sufficiently difficult big data scenario I would see on the job and requires a professional-level approach to solving problems. The exam captured all the relevant elements of data science and machine learning, and the challenge made the experience completely non-trivial."
- Stuart Horsman, Cloudera, Cloudera Certified Professional: Data Scientist

Learn More About Cloudera's Training and Professional Certification Programs
To learn more about Cloudera's comprehensive offering of big data training programs and professional certifications, including the new CCP: Data Scientist program, please visit:
http://university.cloudera.com.

About Cloudera
Cloudera is revolutionizing enterprise data management by offering the first unified Platform for Big Data, an enterprise data hub built on Apache Hadoop™. Cloudera offers enterprises one place to store, process and analyze all their data, empowering them to extend the value of existing investments while enabling fundamental new ways to derive value from their data. Only Cloudera offers everything needed on a journey to an enterprise data hub, including software for business critical data challenges such as storage, access, management, analysis, security and search. As the leading educator of Hadoop professionals, Cloudera has trained over 20,000 individuals worldwide. Over 900 partners and a seasoned professional services team help deliver greater time to value. Finally, only Cloudera provides proactive and predictive support to run an enterprise data hub with confidence. Leading organizations in every industry plus top public sector organizations globally run Cloudera in production.
www.cloudera.com

Connect with Cloudera
Read our Vision blog: http://vision.cloudera.com
Follow Cloudera on Twitter: http://twitter.com/cloudera
Follow Cloudera University on Twitter: http://twitter.com/ClouderaU
Visit us on Facebook: http://www.facebook.com/cloudera

Cloudera, Cloudera Platform for Big Data, Cloudera Enterprise Basic Edition, Cloudera Enterprise Flex Edition, Cloudera Enterprise Data Hub Edition and CDH are trademarks or registered trademarks of Cloudera in the United States and in jurisdictions throughout the world. All other company and product names may be trade names or trademarks of their respective owners.

Add to Digg Bookmark with del.icio.us Add to Newsvine

More Stories By Marketwired .

Copyright © 2009 Marketwired. All rights reserved. All the news releases provided by Marketwired are copyrighted. Any forms of copying other than an individual user's personal reference without express written permission is prohibited. Further distribution of these materials is strictly forbidden, including but not limited to, posting, emailing, faxing, archiving in a public database, redistributing via a computer network or in a printed form.

@ThingsExpo Stories
How do APIs and IoT relate? The answer is not as simple as merely adding an API on top of a dumb device, but rather about understanding the architectural patterns for implementing an IoT fabric. There are typically two or three trends: Exposing the device to a management framework Exposing that management framework to a business centric logic Exposing that business layer and data to end users. This last trend is the IoT stack, which involves a new shift in the separation of what stuff happens, where data lives and where the interface lies. For instance, it's a mix of architectural styles ...
Connected devices and the Internet of Things are getting significant momentum in 2014. In his session at Internet of @ThingsExpo, Jim Hunter, Chief Scientist & Technology Evangelist at Greenwave Systems, examined three key elements that together will drive mass adoption of the IoT before the end of 2015. The first element is the recent advent of robust open source protocols (like AllJoyn and WebRTC) that facilitate M2M communication. The second is broad availability of flexible, cost-effective storage designed to handle the massive surge in back-end data in a world where timely analytics is e...
The Internet of Things will put IT to its ultimate test by creating infinite new opportunities to digitize products and services, generate and analyze new data to improve customer satisfaction, and discover new ways to gain a competitive advantage across nearly every industry. In order to help corporate business units to capitalize on the rapidly evolving IoT opportunities, IT must stand up to a new set of challenges. In his session at @ThingsExpo, Jeff Kaplan, Managing Director of THINKstrategies, will examine why IT must finally fulfill its role in support of its SBUs or face a new round of...
We are reaching the end of the beginning with WebRTC, and real systems using this technology have begun to appear. One challenge that faces every WebRTC deployment (in some form or another) is identity management. For example, if you have an existing service – possibly built on a variety of different PaaS/SaaS offerings – and you want to add real-time communications you are faced with a challenge relating to user management, authentication, authorization, and validation. Service providers will want to use their existing identities, but these will have credentials already that are (hopefully) i...
Cultural, regulatory, environmental, political and economic (CREPE) conditions over the past decade are creating cross-industry solution spaces that require processes and technologies from both the Internet of Things (IoT), and Data Management and Analytics (DMA). These solution spaces are evolving into Sensor Analytics Ecosystems (SAE) that represent significant new opportunities for organizations of all types. Public Utilities throughout the world, providing electricity, natural gas and water, are pursuing SmartGrid initiatives that represent one of the more mature examples of SAE. We have s...
"Matrix is an ambitious open standard and implementation that's set up to break down the fragmentation problems that exist in IP messaging and VoIP communication," explained John Woolf, Technical Evangelist at Matrix, in this SYS-CON.tv interview at @ThingsExpo, held Nov 4–6, 2014, at the Santa Clara Convention Center in Santa Clara, CA.
The Internet of Things will greatly expand the opportunities for data collection and new business models driven off of that data. In her session at @ThingsExpo, Esmeralda Swartz, CMO of MetraTech, discussed how for this to be effective you not only need to have infrastructure and operational models capable of utilizing this new phenomenon, but increasingly service providers will need to convince a skeptical public to participate. Get ready to show them the money!
One of the biggest challenges when developing connected devices is identifying user value and delivering it through successful user experiences. In his session at Internet of @ThingsExpo, Mike Kuniavsky, Principal Scientist, Innovation Services at PARC, described an IoT-specific approach to user experience design that combines approaches from interaction design, industrial design and service design to create experiences that go beyond simple connected gadgets to create lasting, multi-device experiences grounded in people's real needs and desires.
P2P RTC will impact the landscape of communications, shifting from traditional telephony style communications models to OTT (Over-The-Top) cloud assisted & PaaS (Platform as a Service) communication services. The P2P shift will impact many areas of our lives, from mobile communication, human interactive web services, RTC and telephony infrastructure, user federation, security and privacy implications, business costs, and scalability. In his session at @ThingsExpo, Robin Raymond, Chief Architect at Hookflash, will walk through the shifting landscape of traditional telephone and voice services ...
Scott Jenson leads a project called The Physical Web within the Chrome team at Google. Project members are working to take the scalability and openness of the web and use it to talk to the exponentially exploding range of smart devices. Nearly every company today working on the IoT comes up with the same basic solution: use my server and you'll be fine. But if we really believe there will be trillions of these devices, that just can't scale. We need a system that is open a scalable and by using the URL as a basic building block, we open this up and get the same resilience that the web enjoys.
The Internet of Things is tied together with a thin strand that is known as time. Coincidentally, at the core of nearly all data analytics is a timestamp. When working with time series data there are a few core principles that everyone should consider, especially across datasets where time is the common boundary. In his session at Internet of @ThingsExpo, Jim Scott, Director of Enterprise Strategy & Architecture at MapR Technologies, discussed single-value, geo-spatial, and log time series data. By focusing on enterprise applications and the data center, he will use OpenTSDB as an example t...
The Domain Name Service (DNS) is one of the most important components in networking infrastructure, enabling users and services to access applications by translating URLs (names) into IP addresses (numbers). Because every icon and URL and all embedded content on a website requires a DNS lookup loading complex sites necessitates hundreds of DNS queries. In addition, as more internet-enabled ‘Things' get connected, people will rely on DNS to name and find their fridges, toasters and toilets. According to a recent IDG Research Services Survey this rate of traffic will only grow. What's driving t...
Enthusiasm for the Internet of Things has reached an all-time high. In 2013 alone, venture capitalists spent more than $1 billion dollars investing in the IoT space. With "smart" appliances and devices, IoT covers wearable smart devices, cloud services to hardware companies. Nest, a Google company, detects temperatures inside homes and automatically adjusts it by tracking its user's habit. These technologies are quickly developing and with it come challenges such as bridging infrastructure gaps, abiding by privacy concerns and making the concept a reality. These challenges can't be addressed w...
Explosive growth in connected devices. Enormous amounts of data for collection and analysis. Critical use of data for split-second decision making and actionable information. All three are factors in making the Internet of Things a reality. Yet, any one factor would have an IT organization pondering its infrastructure strategy. How should your organization enhance its IT framework to enable an Internet of Things implementation? In his session at Internet of @ThingsExpo, James Kirkland, Chief Architect for the Internet of Things and Intelligent Systems at Red Hat, described how to revolutioniz...
Bit6 today issued a challenge to the technology community implementing Web Real Time Communication (WebRTC). To leap beyond WebRTC’s significant limitations and fully leverage its underlying value to accelerate innovation, application developers need to consider the entire communications ecosystem.
The definition of IoT is not new, in fact it’s been around for over a decade. What has changed is the public's awareness that the technology we use on a daily basis has caught up on the vision of an always on, always connected world. If you look into the details of what comprises the IoT, you’ll see that it includes everything from cloud computing, Big Data analytics, “Things,” Web communication, applications, network, storage, etc. It is essentially including everything connected online from hardware to software, or as we like to say, it’s an Internet of many different things. The difference ...
Cloud Expo 2014 TV commercials will feature @ThingsExpo, which was launched in June, 2014 at New York City's Javits Center as the largest 'Internet of Things' event in the world.
SYS-CON Events announced today that Windstream, a leading provider of advanced network and cloud communications, has been named “Silver Sponsor” of SYS-CON's 16th International Cloud Expo®, which will take place on June 9–11, 2015, at the Javits Center in New York, NY. Windstream (Nasdaq: WIN), a FORTUNE 500 and S&P 500 company, is a leading provider of advanced network communications, including cloud computing and managed services, to businesses nationwide. The company also offers broadband, phone and digital TV services to consumers primarily in rural areas.
"There is a natural synchronization between the business models, the IoT is there to support ,” explained Brendan O'Brien, Co-founder and Chief Architect of Aria Systems, in this SYS-CON.tv interview at the 15th International Cloud Expo®, held Nov 4–6, 2014, at the Santa Clara Convention Center in Santa Clara, CA.
The major cloud platforms defy a simple, side-by-side analysis. Each of the major IaaS public-cloud platforms offers their own unique strengths and functionality. Options for on-site private cloud are diverse as well, and must be designed and deployed while taking existing legacy architecture and infrastructure into account. Then the reality is that most enterprises are embarking on a hybrid cloud strategy and programs. In this Power Panel at 15th Cloud Expo (http://www.CloudComputingExpo.com), moderated by Ashar Baig, Research Director, Cloud, at Gigaom Research, Nate Gordon, Director of T...