Apache Authors: Pat Romanski, John Mertic, Liz McMillan, Elizabeth White, Janakiram MSV

Blog Feed Post

Talend Simplifies Big Data Further with New Release of Enterprise Open Source Integration Platform

Open source software leader advances next-generation integration solution with big data profiling for Hadoop, support of major NoSQL databases and increased usability features

Maidenhead, UK - 5 November 2012 - Talend, a global open source software leader, today announced the availability of version 5.2 of its next-generation integration platform, the only offering that provides a unified environment for managing the entire lifecycle across data, application and process integration requirements. With version 5.2, Talend extends the industry's most flexible, scalable and adaptive integration platform with the introduction of key new capabilities, including big data profiling for Hadoop, support for widely-used and deployed NoSQL databases, and a set of improvements that increases product usability and performance across the entire platform.

Big Data Profiling
In its mission to democratise big data, Talend has focused extensively on solutions that make deploying and managing Apache Hadoop and related technologies simple, without requiring specific expertise in these areas. With version 5.2, Talend has taken its big data strategy a step further by adding big data profiling for Hadoop, providing companies with the ability to discover and understand data in Hadoop clusters. Among the typical problems associated with data quality are duplication, incompleteness and inconsistency, which create inefficiencies in data processing. Talend Platform for Big Data includes new capabilities for visibility into big data in all its forms and locations. These include the ability to analyse data in Hive databases on Hadoop "in place" without extraction and the ability to perform data hygiene tasks, including data cleansing, enrichment, matching and de-duplication directly inside the Hadoop cluster through Hadoop code generation.

Simplified NoSQL Integration with Hadoop
Talend 5.2 adds support for NoSQL databases in its integration solutions, Talend Platform for Big Data and Talend Open Studio for Big Data, with an initial set of connectors for Cassandra, HBase and MongoDB. Built on Talend's award-winning open source integration technology, Talend Open Studio for Big Data is a powerful and versatile open source solution for big data integration that natively supports Apache Hadoop, including connectors for Hadoop Distributed File System (HDFS), HCatalog, Hive, Oozie, Pig and Sqoop - in addition to the more than 450 connectors included natively in the product. As NoSQL has become the go-to technology for certain data architectures, the integration of these platforms into Talend's big data solution enables customers to use these new connectors to migrate and synchronise data between NoSQL databases and all other data stores and systems.

"Talend version 5.2 delivers on our vision of simplifying the development, integration and management of big data so that businesses can focus on using that data to make faster and more informed decisions," said Fabrice Bonan, co-founder and chief technical officer, Talend. "We provide the most powerful and versatile open source, big data solution to help organisations load, extract and improve disparate data while leveraging the massively parallel processing power of big data technologies including Apache Hadoop and leading NoSQL databases."

Latest Release of Talend's Integration Products
In addition to Talend's big data enhancements, Talend introduces version 5.2 of its flagship data integration products that leverage the Talend Unified Platform. New features focus on product usability, user productivity improvements and performance to provide a more robust and easier to use solution.

  • Talend Enterprise Data Integration - In v5.2, parallel execution of jobs can now leverage multi-core hardware. This new version also supports continuous integration between development, test and production environments and is integrated with open source build manager Maven.

  • Talend Enterprise Data Quality - Version 5.2 includes expanded address validation algorithms, precise e-mail validity detection, and native fraud detection capabilities. A new set of components allows customers to use Melissa Data to validate addresses.

  • Talend Enterprise MDM - Support for a wider range of enterprise architectures in v5.2 lowers the barrier to MDM adoption; organisations can now use their Oracle, MySQL, Derby or H2 databases as the underlying MDM data store.

  • Talend Enterprise ESB - In this version, Continuous Integration between development, test and production environments is now available. Version control system Nexus is also supported for versioning and deployment.

  • Talend Enterprise BPM - Talend v5.2 presents a fully integrated BPM engine into the Talend Runtime. Talend users only need to manage a single container, which can run data jobs, web services, REST applications and now BPM processes. With fewer moving parts in system environments and the flexibility to run multiple instances of different application types within the same container, the work of the IT administrator is significantly reduced in terms of management and maintenance of the software.

Version 5.2 of Talend Open Studio for Data Integration, Talend Open Studio for Data Quality, Talend Open Studio for MDM, Talend Open Studio for ESB and Talend Open Studio for Big Data are available for immediate download from Talend's web site www.talend.com/. Version 5.2 of the commercial subscription products, available before the end of 2012, will be provided to all existing Talend customers as part of their subscription agreement and can be procured through the usual Talend representatives or partners.

About Talend
Talend is the recognised market leader in open source integration solutions. The company's enterprise integration platform helps organisations minimise costs and maximise the value of data integration, ETL, data quality, master data management, application integration and business process management, while supporting their shift toward the Cloud and Big Data. More than 3,500 paying customers worldwide, including eBay, ING, The Weather Channel, Deutsche Post and Allianz, subscribe to Talend's solutions and services. With over 20 million downloads, Talend's products are the most trusted integration solutions in the world. The company has major offices in North America, Europe and Asia, and a global network of technical and services partners. For more information, please visit http://www.talend.com/.


PR Contacts:
Selene Regan
[email protected]

Tom Webb
01252 727313
[email protected]

Read the original blog entry...

More Stories By RealWire News Distribution

RealWire is a global news release distribution service specialising in the online media. The RealWire approach focuses on delivering relevant content to the receivers of our client's news releases. As we know that it is only through delivering relevance, that influence can ever be achieved.

@ThingsExpo Stories
SYS-CON Events announced today that Transparent Cloud Computing (T-Cloud) Consortium will exhibit at the 19th International Cloud Expo®, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. The Transparent Cloud Computing Consortium (T-Cloud Consortium) will conduct research activities into changes in the computing model as a result of collaboration between "device" and "cloud" and the creation of new value and markets through organic data proces...
In the next five to ten years, millions, if not billions of things will become smarter. This smartness goes beyond connected things in our homes like the fridge, thermostat and fancy lighting, and into heavily regulated industries including aerospace, pharmaceutical/medical devices and energy. “Smartness” will embed itself within individual products that are part of our daily lives. We will engage with smart products - learning from them, informing them, and communicating with them. Smart produc...
SYS-CON Events announced today that Enzu will exhibit at the 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. Enzu’s mission is to be the leading provider of enterprise cloud solutions worldwide. Enzu enables online businesses to use its IT infrastructure to their competitive advantage. By offering a suite of proven hosting and management services, Enzu wants companies to focus on the core of their online busine...
WebRTC adoption has generated a wave of creative uses of communications and collaboration through websites, sales apps, customer care and business applications. As WebRTC has become more mainstream it has evolved to use cases beyond the original peer-to-peer case, which has led to a repeating requirement for interoperability with existing infrastructures. In his session at @ThingsExpo, Graham Holt, Executive Vice President of Daitan Group, will cover implementation examples that have enabled ea...
SYS-CON Events announced today that Coalfire will exhibit at the 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. Coalfire is the trusted leader in cybersecurity risk management and compliance services. Coalfire integrates advisory and technical assessments and recommendations to the corporate directors, executives, boards, and IT organizations for global brands and organizations in the technology, cloud, health...
In past @ThingsExpo presentations, Joseph di Paolantonio has explored how various Internet of Things (IoT) and data management and analytics (DMA) solution spaces will come together as sensor analytics ecosystems. This year, in his session at @ThingsExpo, Joseph di Paolantonio from DataArchon, will be adding the numerous Transportation areas, from autonomous vehicles to “Uber for containers.” While IoT data in any one area of Transportation will have a huge impact in that area, combining sensor...
November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. Penta Security is a leading vendor for data security solutions, including its encryption solution, D’Amo. By using FPE technology, D’Amo allows for the implementation of encryption technology to sensitive data fields without modification to schema in the database environment. With businesses having their data become increasingly more complicated in their mission-critical applications (such as ERP, CRM, HRM), continued ...
SYS-CON Events announced today that Cloudbric, a leading website security provider, will exhibit at the 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. Cloudbric is an elite full service website protection solution specifically designed for IT novices, entrepreneurs, and small and medium businesses. First launched in 2015, Cloudbric is based on the enterprise level Web Application Firewall by Penta Security Sys...
WebRTC sits at the intersection between VoIP and the Web. As such, it poses some interesting challenges for those developing services on top of it, but also for those who need to test and monitor these services. In his session at WebRTC Summit, Tsahi Levent-Levi, co-founder of testRTC, reviewed the various challenges posed by WebRTC when it comes to testing and monitoring and on ways to overcome them.
"Matrix is an ambitious open standard and implementation that's set up to break down the fragmentation problems that exist in IP messaging and VoIP communication," explained John Woolf, Technical Evangelist at Matrix, in this SYS-CON.tv interview at @ThingsExpo, held Nov 4–6, 2014, at the Santa Clara Convention Center in Santa Clara, CA.
DevOps is being widely accepted (if not fully adopted) as essential in enterprise IT. But as Enterprise DevOps gains maturity, expands scope, and increases velocity, the need for data-driven decisions across teams becomes more acute. DevOps teams in any modern business must wrangle the ‘digital exhaust’ from the delivery toolchain, "pervasive" and "cognitive" computing, APIs and services, mobile devices and applications, the Internet of Things, and now even blockchain. In this power panel at @...
In his general session at 18th Cloud Expo, Lee Atchison, Principal Cloud Architect and Advocate at New Relic, discussed cloud as a ‘better data center’ and how it adds new capacity (faster) and improves application availability (redundancy). The cloud is a ‘Dynamic Tool for Dynamic Apps’ and resource allocation is an integral part of your application architecture, so use only the resources you need and allocate /de-allocate resources on the fly.
SYS-CON Events announced today that SoftNet Solutions will exhibit at the 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. SoftNet Solutions specializes in Enterprise Solutions for Hadoop and Big Data. It offers customers the most open, robust, and value-conscious portfolio of solutions, services, and tools for the shortest route to success with Big Data. The unique differentiator is the ability to architect and ...
SYS-CON Events announced today that Embotics, the cloud automation company, will exhibit at the 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. Embotics is the cloud automation company for IT organizations and service providers that need to improve provisioning or enable self-service capabilities. With a relentless focus on delivering a premier user experience and unmatched customer support, Embotics is the fas...
SYS-CON Events announced today that MathFreeOn will exhibit at the 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. MathFreeOn is Software as a Service (SaaS) used in Engineering and Math education. Write scripts and solve math problems online. MathFreeOn provides online courses for beginners or amateurs who have difficulties in writing scripts. In accordance with various mathematical topics, there are more tha...
SYS-CON Events announced today that Niagara Networks will exhibit at the 19th International Cloud Expo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. Niagara Networks offers the highest port-density systems, and the most complete Next-Generation Network Visibility systems including Network Packet Brokers, Bypass Switches, and Network TAPs.
The best way to leverage your Cloud Expo presence as a sponsor and exhibitor is to plan your news announcements around our events. The press covering Cloud Expo and @ThingsExpo will have access to these releases and will amplify your news announcements. More than two dozen Cloud companies either set deals at our shows or have announced their mergers and acquisitions at Cloud Expo. Product announcements during our show provide your company with the most reach through our targeted audiences.
Virgil consists of an open-source encryption library, which implements Cryptographic Message Syntax (CMS) and Elliptic Curve Integrated Encryption Scheme (ECIES) (including RSA schema), a Key Management API, and a cloud-based Key Management Service (Virgil Keys). The Virgil Keys Service consists of a public key service and a private key escrow service. 

OnProcess Technology has announced it will be a featured speaker at @ThingsExpo, taking place November 1 - 3, 2016, in Santa Clara, California. Dan Gettens, OnProcess’ Chief Analytics Officer, will discuss how Internet of Things (IoT) data can be leveraged to predict product failures, improve uptime and slash costly inventory stock. @ThingsExpo is an annual gathering of IoT and cloud developers, practitioners and thought-leaders who exchange ideas and insights on topics ranging from Big Data in...
@ThingsExpo has been named the Top 5 Most Influential Internet of Things Brand by Onalytica in the ‘The Internet of Things Landscape 2015: Top 100 Individuals and Brands.' Onalytica analyzed Twitter conversations around the #IoT debate to uncover the most influential brands and individuals driving the conversation. Onalytica captured data from 56,224 users. The PageRank based methodology they use to extract influencers on a particular topic (tweets mentioning #InternetofThings or #IoT in this ...