Apache Authors: Carmen Gonzalez, Liz McMillan, Elizabeth White, Pat Romanski, Christopher Harrold

News Feed Item

Version 9 of Sinequa ES Now Available

Sinequa, a leading real-time big data search & analytics software company, announced today that they have released Sinequa ES Version 9, a major new version of its solution software.

Sinequa ES Version 9 offers a whole range of new functionalities, above all the integration and use of Hadoop, as well as many features inspired by the most innovative ideas from recent customer projects. The Sinequa R&D team has continued to develop new connectors to data sources, now at a total of 140 (including PTC Windchill, Mongo DB, Scality, Office 365, box…), to refine language analysis in the 19 languages covered by Sinequa, specifically in Asian languages including Chinese, Japanese and Korean. Furthermore, Version 9 offers advanced geo-location functions, tying products or people to locations and taking distance into account in relevance rankings. It also integrates with Amazon Web Services (AWS) such that customers can benefit from an “elastic” Sinequa grid hosted on the Amazon cloud and from certain services specific to AWS. The “elasticity” of Sinequa on AWS lets customers instantly scale computing resources to their requirements at each moment in time, be it for the indexing a large new data source or when adding a large number of users.

“The release of Sinequa ES Version 9 is a major step forward, in particular through the Hadoop / Mahout integration, that re-enforces our position in the big data arena by offering automatic classification and clustering, recommendations, and predictive analysis,” explains Alexandre Bilger, CEO of Sinequa. “Our work on Hadoop does not signal any change concerning our positioning in real-time search & analytics. Our users expect and demand real-time service. With the new version we have further enriched our index, even created a “hyper index” by indexing our index, in order to provide users with more concentrated information in real time, while simplifying user interfaces for end users and administrators alike.”

Sinequa ES V9 offers Hadoop integration on three different levels:

The Hadoop File System HDFS can be accessed as a data source via a new Sinequa connector.

Bi-directional Hadoop integration: Sinequa ES V9 can index data from Hadoop, but additionally, the Sinequa index can also be accessed by Hadoop for “typical” Hadoop processing: The calculation of relevance rankings and recommendations, and for predictive analysis. Moreover, Sinequa ES V9 can use Hadoop calculations for “smart linguistic indexing” and “index re-composition” according to clever but compute-intensive algorithms making use of company- and trade-specific knowledge (dictionaries, ontologies, taxonomies, directories).

With Hadoop Mahout (machine learning) we can leverage algorithms for automatic classification, recommendations, and predictive analysis.

For automatic classification, users can provide a large corpus of already classified documents to Sinequa/Mahout and ask the system to classify new incoming documents “the same way”. If the system gets it wrong, users make corrections, and the system will refine its classification by taking these corrections into account. This method of machine learning in classification is helpful when large amounts of classified/categorized documents already exist, while users find it difficult to express rules for this classification. These difficulties may come from different views on document sources by different professional communities. The existing classification serves as a “de facto classification method”.


A number of performance optimizations have been introduced in Sinequa ES V9, some of them the result of work in the most innovative customer projects where they have proved highly effective and useful.

Amongst these are Intelligent Caching mechanisms and hyper-indexing. They help Sinequa ES V9 overcome two serious challenges to real-time responses:

Some data sources have not been designed for fast data extraction on a massive scale. Sinequa ES V9 introduces a smart caching mechanism for data from such sources. This combines the advantages of search and “elastic” storage. Re-indexing with new analytic concepts is no longer hampered by slow data sources and offers “persistent insight”. Rules can be defined for the data to be extracted and to be refreshed in the cache.

Extracting relevant information for knowledge workers, such as scientists in the pharmaceutical industry, requires knowledge of synonyms and related topics in areas as diverse as diseases, genes, drugs, molecules, mechanisms of action, etc. Using a “shot gun approach” for finding such relevant information by simply launching as many queries as there are synonyms and related topics would never produce results at the required speed. That is why Sinequa builds a “semantically rich index” that uses company and trade knowledge of a subject domain in order to aggregate information on synonyms and related concepts.

In addition, creating a “hyper index” by indexing the “original” index, allows storing the complete “fingerprint” of a person extracted from a large corpus of documents. This fingerprint contains the areas of expertise of a person and the topics he or she has worked on over time. Consider it a “semantic join” of persons and topics that appear together in documents. The documents themselves are no longer part of the join, making the hyper-index very compact and its retrieval extremely fast. This allows a simple query (centered on just one topic) to deliver information on all semantically related topics, and on the best available experts on a given topic in just about one second, even when dealing with big data.

A pharmaceutical company like AstraZeneca can get a real-time view of the best team of experts on a research topic and related subjects (e.g. disease, genes, drugs, active molecules, mechanisms of action, lab tests, clinical tests, etc.) Users can even tweak the relevance of related subjects by moving sliders on their screens and see the “dream team” change in real time.

For more information on Sinequa ES V9, contact [email protected]

About Sinequa

Sinequa provides a real-time big data search & analytics platform for Fortune Global 2000 companies and government agencies. It offers users Unified Information Access to all textual and database data, supported by powerful analytics. Strong visualization enables intuitive and conversational discovery of actionable information. More than 250 of the world's largest and most information-intensive organizations rely on Sinequa to put business critical information at the fingertips of their employees, including Airbus, AstraZeneca, Atos, Biogen Idec, Credit Agricole, Mercer, and Siemens. Based in Paris, with offices in Frankfurt and London, Sinequa develops its expertise and its business across the world with an important network of technology and business partners. For more information visit http://www.sinequa.com/en/

More Stories By Business Wire

Copyright © 2009 Business Wire. All rights reserved. Republication or redistribution of Business Wire content is expressly prohibited without the prior written consent of Business Wire. Business Wire shall not be liable for any errors or delays in the content, or for any actions taken in reliance thereon.

IoT & Smart Cities Stories
The platform combines the strengths of Singtel's extensive, intelligent network capabilities with Microsoft's cloud expertise to create a unique solution that sets new standards for IoT applications," said Mr Diomedes Kastanis, Head of IoT at Singtel. "Our solution provides speed, transparency and flexibility, paving the way for a more pervasive use of IoT to accelerate enterprises' digitalisation efforts. AI-powered intelligent connectivity over Microsoft Azure will be the fastest connected pat...
There are many examples of disruption in consumer space – Uber disrupting the cab industry, Airbnb disrupting the hospitality industry and so on; but have you wondered who is disrupting support and operations? AISERA helps make businesses and customers successful by offering consumer-like user experience for support and operations. We have built the world’s first AI-driven IT / HR / Cloud / Customer Support and Operations solution.
Codete accelerates their clients growth through technological expertise and experience. Codite team works with organizations to meet the challenges that digitalization presents. Their clients include digital start-ups as well as established enterprises in the IT industry. To stay competitive in a highly innovative IT industry, strong R&D departments and bold spin-off initiatives is a must. Codete Data Science and Software Architects teams help corporate clients to stay up to date with the mod...
At CloudEXPO Silicon Valley, June 24-26, 2019, Digital Transformation (DX) is a major focus with expanded DevOpsSUMMIT and FinTechEXPO programs within the DXWorldEXPO agenda. Successful transformation requires a laser focus on being data-driven and on using all the tools available that enable transformation if they plan to survive over the long term. A total of 88% of Fortune 500 companies from a generation ago are now out of business. Only 12% still survive. Similar percentages are found throug...
Druva is the global leader in Cloud Data Protection and Management, delivering the industry's first data management-as-a-service solution that aggregates data from endpoints, servers and cloud applications and leverages the public cloud to offer a single pane of glass to enable data protection, governance and intelligence-dramatically increasing the availability and visibility of business critical information, while reducing the risk, cost and complexity of managing and protecting it. Druva's...
BMC has unmatched experience in IT management, supporting 92 of the Forbes Global 100, and earning recognition as an ITSM Gartner Magic Quadrant Leader for five years running. Our solutions offer speed, agility, and efficiency to tackle business challenges in the areas of service management, automation, operations, and the mainframe.
The Jevons Paradox suggests that when technological advances increase efficiency of a resource, it results in an overall increase in consumption. Writing on the increased use of coal as a result of technological improvements, 19th-century economist William Stanley Jevons found that these improvements led to the development of new ways to utilize coal. In his session at 19th Cloud Expo, Mark Thiele, Chief Strategy Officer for Apcera, compared the Jevons Paradox to modern-day enterprise IT, examin...
With 10 simultaneous tracks, keynotes, general sessions and targeted breakout classes, @CloudEXPO and DXWorldEXPO are two of the most important technology events of the year. Since its launch over eight years ago, @CloudEXPO and DXWorldEXPO have presented a rock star faculty as well as showcased hundreds of sponsors and exhibitors! In this blog post, we provide 7 tips on how, as part of our world-class faculty, you can deliver one of the most popular sessions at our events. But before reading...
DSR is a supplier of project management, consultancy services and IT solutions that increase effectiveness of a company's operations in the production sector. The company combines in-depth knowledge of international companies with expert knowledge utilising IT tools that support manufacturing and distribution processes. DSR ensures optimization and integration of internal processes which is necessary for companies to grow rapidly. The rapid growth is possible thanks, to specialized services an...
At CloudEXPO Silicon Valley, June 24-26, 2019, Digital Transformation (DX) is a major focus with expanded DevOpsSUMMIT and FinTechEXPO programs within the DXWorldEXPO agenda. Successful transformation requires a laser focus on being data-driven and on using all the tools available that enable transformation if they plan to survive over the long term. A total of 88% of Fortune 500 companies from a generation ago are now out of business. Only 12% still survive. Similar percentages are found throug...