AI factory 101: An AI-readiness assessment framework

This blog explores how taking a factory-like approach can help your AI program achieve greater outputs and shares how you can assess your organization’s AI readiness.

This is the second part of a series of five “AI factory 101” posts where Mark Palmer describes the elements of an AI Factory and how any organization can implement one to catapult your organization into a fast, agile, innovative, healthy factory of AI algorithms. Read part one here.

“Model T customers can have any color they want, as long as it’s black.” To some, Henry Ford’s take on the Model T production was rigid, uninspired, and boring. Moreover, the Model T was painfully hard to start, slow, and had poor gas mileage.

However, Ford’s innovations were breathtaking to business leaders who looked under the hood. His approach to factory automation produced legendary breakthroughs. Build time was cut from 12 hours to 93 minutes, factory efficiencies helped slash prices by 70% – from $950 to $260, and output increased from 18,000 to 785,000 cars per year in just seven years.

High-performing AI leaders use factory-inspired ideas today to achieve similar results. A 2024 McKinsey & Company report shows that the high-performing firms (that have adopted AI) ascribe up to 20% of their EBIT (earnings before income and tax) to their use of AI and are twice as likely to employ ‘AI Factory’ concepts than laggards, 42% versus 19%.

Let’s explore how AI Factory thinking can help you achieve Ford-like results and how to assess your organization’s AI readiness to find and fill gaps.

Establishing AI readiness

Harvard Business School professors Marco Iansiti and Karim Lakhani coined the term “AI Factory” in their book ‘Competing in the Age of AI‘. Their concept elevates the creation and use of AI like a factory, from data processing, algorithmic development, tools, an “AI Lab,” and production.

Instead of getting bogged down in a technical quagmire of algorithms, the notion of an AI factory helps leaders think of AI in a systems-thinking way – on the raw materials, experimentation, and the impact of AI on the business.

But how do you know if your organization is ready to adopt an AI factory approach? I would use the following assessment framework focusing on three key areas: Data pipelines, product development, automation and operations.

An AI Factory helps teams navigate an overgrown algorithmic jungle. Most organizations have too many models to choose from. For example, the AI model repository Hugging Face contains over 700,000 AI models with more being released every day. This vast number of options can confuse analysts rather than help them move more quickly.

An AI Factory helps teams automate and simplify the selection and use of the right LLM services for the job at hand. Let’s explore how.

Factory-like AI data pipelines

All factories start with raw materials. For cars, that means steel, sparkplugs, and rubber.

AI Factories start with raw materials too, in the form of data. New types of data. Specifically, unstructured data.

I’d wager a guess that most organizations have pipelines that weren’t designed to process interactions with customers, conversations, notes from salespeople, or social media sentiment. And that most organizations, until recently, have overlooked potential insights in audio, video, and images (safety monitoring based on video, drone footage for applications like crop management in agriculture, or computer-vision-aware automation and safety.)

AI leaders know that GenAI puts new demands on data pipelines. McKinsey found that leaders are 2.5 times more likely (42% versus 17%) to incorporate new data into existing pipelines to power LLMs than other adopters.

Here are five questions to ask to understand AI-readiness of your data pipelines:

Do you systematically engage business users to discuss what new forms of data could help them build innovative AI-based applications and create a data roadmap to incorporate those new forms of data into existing or refined data pipelines?

Have you reimagined your data pipeline to fuel AI with new types of data that it can uniquely integrate, clean, label, transform, augment, optimize, and publish unstructured data, including text, audio, streaming video, images, and IoT sensor data?

Do you create vector embeddings in your data pipeline to process unstructured data and use Retrieval-Augmented Generation (RAG) to combine large language models (LLMs) with existing vetted data to enhance the accuracy and relevance of GenAI responses?

Have you extended your data governance policies to include GenAI, LLMs, and RAG, including checks for accuracy, bias, privacy, hallucinations, and fairness?

Is the data from these pipelines and the LLM applications that use it under a single version control system?

If you answered “no” or “I don’t know” to any of these questions, ask your Chief Data Officer to understand your data pipeline and how it’s been adapted for AI. Understanding the data you have at hand is the first step toward building an efficient AI Factory.

Factory-like AI development

Factories leaders carefully collaborate with designers, engineers, and product managers whose products they produce.

AI Factories are no different. Research shows that 46% of GenAI leaders focus on systems of AI production using tools, procedures, and training that encourage collaboration, compared to 15% of the general population of adopters. These include quality assurance, documentation, usage policies, rigorous A/B testing, champion/challenger optimization, and a culture of killing models that either don’t work in practice or drift and become ineffective.

Here are five questions to ask about how factory-like your AI development process is:

Do you have a robust data science and machine learning platform that encourages cross-functional collaboration, automation, and co-pilot style creation of fast-start code?

Do your processes include regular AI evaluation checkpoints to review AI performance, metrics, drift, efficacy, and cost and continuously refine them?

Do you have a customer feedback loop that captures feedback about whether AI is producing fair, ethical, and unbiased recommendations, actions, and observations?

Do you employ “Decision Observers” for AI (a term coined by Nobel Prize-winning behavioral economist Daniel Kahneman) who work with teams to identify ethical concerns and security issues?

Are your AI monitoring tools available, not only to teams “inside” the factory (data science teams), but also to “customers” (the business users, customers, and teams that use AI’s output)?

Factory-like AI automation

Modern manufacturing operations are automated. They employ robotics to perform tasks that humans are either unable to do or are too dangerous to perform. They automatically monitor production yield and quality to raise exceptions to human operators. This automation helps speed production, improve quality, and spark innovation.

The same is true for AI factories.

In the case of AI, automation augments critical thinking and software development. Agents, co-pilots, and tooling help teams with code development, documentation generation, test coverage, and intelligent debugging. AI assists in data preparation, model optimization, and performance analysis. They can even help augment the user experience design process.

Here are five ways to check out how deeply you’ve embraced AI automation:

Do you use AI agents to generate code snippets, suggest completions, create test cases, perform unit and integration testing, and help identify and fix bugs faster based on natural language descriptions?

Do you use AI to fine-tune and optimize models, suggest improvements to architecture, choose hyperparameters, and select training data?

Do you use AI to clean, preprocess, and augment datasets for training GenAI models?

Do you use AI agents to generate and maintain documentation for code, APIs, user guides, and check for security vulnerabilities as projects evolve?

Do you use AI to help design user interface mockups, suggest layouts, and predict user behavior?

By leveraging AI in these ways, developers can focus on high-level design and creative problem-solving. This leads to faster development cycles, higher-quality code, and more innovative applications.

Factory-like AI model operations and monitoring

In manufacturing, increased automation places greater demands on monitoring and exception handling.

Again, the same is true for AI factories, and research reveals that AI high-performers are jaw-droppingly advanced in their emphasis on AI operations and monitoring. McKinsey found that GenAI leaders are almost six times (7% to 41%) as likely to track, measure, and resolve AI model performance, data quality, infrastructure health, financial operations, and more.

Here are five questions to assess the AI-readiness of your operations and monitoring:

Do you have a team that regularly evaluates the accuracy of your AI model’s predictions against ground truth data, helping identify performance degradation over time?

Do you monitor for data drift (changes in the input data distribution) and prediction drift (changes in the output distribution), and regularly check data for missing values, anomalies, and inconsistencies that could affect model performance?

Do you implement alerts for significant changes in key metrics such as accuracy, precision, recall, and F1 score?

Do you have a regular feedback loop to continuously improve your model, using real-world data and user feedback to retrain and fine-tune it, ensuring it adapts to new patterns and trends?

Do you schedule regular retraining sessions for your model using updated data to help mitigate the effects of concept drift and ensure that the model remains accurate and relevant?

These monitoring best practices of AI leaders help ensure your AI models remain robust, reliable, and capable of delivering accurate predictions in a dynamic real-world business environment.

AI factories produce innovation, energy, and ROI

To recap: assess your AI readiness by asking questions about your data pipelines, development processes, automation, and operations. By refining these elements, organizations can turn raw data into actionable insights, streamline productions, and ensure accuracy and relevance.

High-performing companies already leveraging AI Factory principles report substantial benefits, such as in 10% to 20% ROI and increased innovation. Keep the frameworks discussed in this blog in mind and, just like Ford, you’ll produce more AI output, drive down costs, and spark growth with AI that helps you lap the field in today’s digital racetrack.

For more information on what it takes to build an AI factory, read the first blog in the AI factory 101 series here. Discover why KDB.AI is a crucial part of the AI factory here.

Learn how Italian innovators ASPI reimagined their technology fabric like a human nervous system to improve the roadway system's safety, efficiency, and transparency following the Genoa Bridge collapse.

Five steps to an enterprise nervous system

In this blog, we explore the key practices that separate successful companies from the rest and discuss several strategies that leaders are adopting to effectively implement and operationalize GenAI to drive meaningful business value.

Lessons you can learn from GenAI high performers

AI factory 101: How to build an AI factory

Book a Demo Azure

BOOK A 30 MIN DEMO

Demo the world’s fastest database for vector, time-series, and real-time analytics

Start your journey to becoming an AI-first enterprise with 100x* more performant data and MLOps pipelines.

Process data at unmatched speed and scale
Build high-performance data-driven applications
Turbocharge analytics tools in the cloud, on premise, or at the edge

“Through our partnership with KX, we’re now harnessing data across the business more efficiently, proving our clients with a more seamless experience, superior pricing, products, and customer service.”

Nikos Tsoskounoglou

Co-Head Quantitative Trading, ADSS

“Outstanding vector-based database performance. Best of breed in time-series, which was my use case.”

Michael D.

Executive Director, Engineering and Execution, Enterprise(> 1000 emp.)

“Wall Street’s secret is out!”

Standy C.

Enterprise (>1000 emp.)

“Even after 20 years, it still produces moments of “wow, that was fast!””

Verified User in Capital Markets

Mid-Market (51-1000 emp.)

“Excellent TSD for real time analytics and machine learning flows.”

Vikas G.

Enterprise (>1000 emp.)

kdb Insights Enterprise on Microsoft Azure is ideal for scalable quant research in the cloud and supports SQL and Python with unparalleled speed. Data-driven organizations choosing KX for faster decision making:

Book your personal demo

By submitting this form, you will also receive sales and/or marketing communications on KX products, services, news and events. You can unsubscribe from receiving communications by visiting our Privacy Policy. You can find further information on how we collect and use your personal data in our Privacy Policy.

*required

A Verified G2 Leader for Time-Series Vector Databases

G2 High Performer Spring 2024 Award - KX

4.8/5 Star Rating

Based on time-series queries running in real-world use cases on customer environments.

To see how kdb performed in independent benchmarks that show similar on replicable data see: TSBS 2023, STAC-M3, DBOps, and Imperial College London Results for High-performance DB benchmarks.

Lessons you can learn from GenAI high performers

Over the past ten months, some GenAI leaders have clearly distinguished themselves, creating substantial returns on investment and setting new benchmarks in their respective industries.

A new McKinsey & Company leadership survey highlights the impressive gains achieved by high-performing companies through the strategic use of GenAI. These companies can now attribute as much as 20% of their EBIT (earnings before income and taxes) to GenAI applications, with 65% of organizations employing these technologies regularly–twice as many as ten months ago.

But what differentiates these GenAI high performers from their counterparts?

In this blog, I’ll explore the key practices that separate successful companies from the rest and discuss several strategies that leaders are adopting to effectively implement and operationalize GenAI to drive meaningful business value.

Adopting a leaders’ mindset toward GenAI

A notable commonality among high-performing GenAI companies is their robust foundation in more traditional AI. These leaders not only have well-established systems designed to extract insights, detect patterns, and support decision-making but also report deriving over 20% of their EBIT from such non-generative systems. The survey suggests that success with traditional AI forms a strong basis for excelling with GenAI, with high performers also reporting over 10% of EBIT attributed directly to GenAI.

Their approach to GenAI also sets leaders apart. Companies at the forefront are more likely to develop proprietary AI models, seek unconventional data sources, and engage legal teams early in the process, unlike their counterparts who often rely on generic, out-of-the-box AI solutions.

Leading firms are also more likely to utilize GenAI more extensively across various departments. Not just in IT, sales, and marketing, but also in areas like risk management, legal, compliance, strategy, finance, supply chain, and inventory management.

Here’s how you can adopt a ‘leader’s’ mindset towards GenAI:

Start with the Basics: Get to grips with traditional AI to build a foundational understanding of the value it can bring to your business. This will allow you to build a strong platform and business case for adopting GenAI.
Customize Your Approach: Use the understanding of the value gained through analytical AI use cases to tailor models to fit your business requirements. Work with partners and platforms that help you customize optimized solutions (like KDB.AI, OpenAI GPT-4, Azure Cognitive Services, etc.) rather than relying on generic, out-the-box alternatives
Expand Usage: Encourage different departments—beyond just IT, sales and marketing—to explore how GenAI can benefit their operations. GenAI isn’t just a time saving and efficiency tool. It can be used for significant value-driving applications throughout your business.

Leaders build AI factories

High performers are more likely than others to report experiencing challenges with their operating models, such as implementing agile ways of working and effective sprint performance management. High performers have a systematic approach, building what Marco Iansiti and Karim R. Lakhani (the authors of ‘Competing in the age of AI’) call an “AI Factory”.

An AI Factory systematizes the culture, processes, and tools required to apply AI. It can be summarized in the following stages:

As the diagram above suggests, the success of an AI Factory is reliant on the data feeding it.

Data is a top priority for organizations building their GenAI initiatives. AI-ready data needs to be well-structured, clean, and annotated to ensure accuracy and efficiency in GenAI applications. According to the McKinsey survey, 70% of leaders say they’ve experienced significant hurdles with data–far and away the top challenge faced by high performers (risk, at 48%, is a distant second-place challenge).

These data challenges are many, including:

Establishing robust data governance: Define clear protocols for data usage and integration
Enhancing data integration: Streamline the incorporation of new data into AI models for quicker, more effective results
Addressing data scarcity: Develop strategies to augment training datasets to improve model accuracy

By overcoming these challenges, leaders can reimagine how they process data, designing modern data pipelines as the first step in forming their AI factory:

Modernized data pipelines: These facilitate the collection and processing of new forms of data, especially unstructured data such as conversations with customers and sales prospects, images, and video. These are pivotal for training more sophisticated AI models.
Algorithm development: Fed by the data pipelines, this is where AI engineers and analysts work together to invent, fine-tune, and train new models, customize off-the-shelf models, or, in some cases, use what already works. Again, high performers are more likely to customize or build entirely new AI models than laggards, who tend to use off-the-shelf algorithms without customization.
Rigorous experimentation: Experimentation is how high performers establish protocols to assess and mitigate AI risks and biases. This involves including legal and compliance teams early in the development process to evaluate privacy, regulatory, or IP concerns.

By adopting an “AI factory” mindset and combining it with a highly performant data analytics tech stack, you can innovate faster, foster continuous improvement, and manage risks effectively.

Learning from high performers

The high-performing GenAI companies have begun to separate from the pack. They’re moving beyond pilots to drive significant business outcomes. But it’s still early – just 5% of over 800 respondents are in the lead, and it’s not too late to catch up.

To become a leader:

Start with a strong foundation in traditional AI
Engage more business functions
Address data challenges early and systematically
Adopt an “AI Factory” develop proprietary models
Reimagine data processes to build a robust AI infrastructure

Apply these lessons to innovate faster, foster innovation, and manage risks, and you’ll be in the lead pack in no time at all.

Read more about what it takes to set up an AI factory in this blog: AI Factory 101.

Five steps to an enterprise nervous system

Next-gen analytics in capital markets: Key takeaways

Webinar: Next-gen analytics in capital markets with NVIDIA

AI factory 101: How to build an AI factory

Webinar: Next-gen analytics in capital markets with NVIDIA

In a recent webinar, hosted by Prabhu Ramamoorthy from NVIDIA, our CEO, Ashok Reddy, joined finance industry leaders to discuss the transformative impact of next-gen analytics and AI on capital markets.

The session provided valuable insights for capital market firms striving to become AI-first enterprises¹. It covered how leaders can use AI to create differentiation, improving trade research (ideation), trade execution, and risk management at a price-to-performance that makes sense.

Watch the full session or browse the key takeaways below.

Key takeaways

Take a holistic approach to AI

The session explored the opportunity of becoming an “AI-first” company to unlock differentiation. While generative AI has caused significant buzz, the true value lies in leveraging the full spectrum of AI capabilities, including traditional, data-driven AI. This holistic approach, combined with the utilization of all available data sources–structured and unstructured—enables firms to unlock deeper insights and drive innovation.

Ensure you have AI-ready data

AI-ready data, which is relevant, representative, trustworthy, bias-free, clean, and in vector form, is crucial for training accurate and efficient AI models. This high-quality data enables firms to build reliable AI applications that deliver tangible business value.

Manage risk and compliance with AI

Explainable AI models are crucial for transparency and auditability, mitigating risks associated with black-box AI systems. AI can also be instrumental in proactively identifying and alleviating risks.

Focus on high-performing technology for maximum ROI

To maximize ROI and generate enterprise value, firms should focus on high-performance technologies. These technologies not only enable greater efficiency but also accelerate model development, ensuring faster time-to-market and more accurate, reliable AI solutions. By leveraging optimized hardware and software, organizations can minimize hallucinations and drive innovation through real-time analytics, advanced AI/ML capabilities, and accelerated processing, ensuring the delivery of tangible business results that go beyond the limitations of early, generative AI-only offerings.

Build ‘AI factories’

An ‘AI factory’ is a model for adopting AI that systematizes the culture, processes, and tools required to apply AI to automate processes, improve decision making, and differentiate your services. It involves setting up data pipelines to process both structured data (transaction, market and reference data, etc.) and unstructured data (analyst reports, audio, video, news articles, etc.). You then use that data to develop and train advanced, accurate models with continuous experimentation. This allows you to build world class analytics, search and recommendation applications.

Accelerate AI with NVIDIA’s Grace Hopper superchip

Using a combination of KX and NVIDIA’s Grace Hopper superchip enables accelerated AI computing. The webinar highlights the impressive results from a recent KX NVIDIA AI Labs project where we processed 2 million documents in 80 minutes vs the 40 hours it took with other technologies.

These results are achieved by optimizing CPU GPU usage, combined with the processing efficiency of KX, which vastly reduces overall memory and energy consumption. This enables faster AI model training and deployment. The integration of KX and NVIDIA technologies ensures efficient and sustainable AI solutions.

Identify patterns in market data

By analyzing the order book and balance, AI can reveal correlations and confounders that impact market dynamics. Advanced AI techniques, such as graph networks and attention models, can reveal hidden patterns in market data to understand liquidity and enhance market efficiency.

Next steps

The webinar provided valuable insights into the transformative potential of AI in capital markets. By adopting a holistic AI approach, leveraging high-performance technologies, ensuring data readiness, and collaborating with industry leaders, firms can navigate the complex landscape of modern finance and unlock new opportunities for growth and innovation.

KX and NVIDIA are committed to supporting firms in their journey to become AI-first enterprises. They offer a joint AI Factory Lab as a free service for customers to develop their own differentiated use cases, providing access to expertise, tools, and resources. To learn more and register your interest in the AI Lab, click here.

An AI-first enterprise is an organization that strategically prioritizes artificial intelligence (AI) throughout its operations, making AI a core component of its culture, infrastructure, and decision-making processes to drive innovation and competitive advantage.

Three Innovative Quant Trading Apps and Three Ideas Everyone Should Steal

The new dynamic data duo: Structured meets unstructured data to win on the generative AI playing field

11 Insights to Help Quants Break Through Data and Analytics Barriers

Book a Demo AWS

BOOK A 30 MIN DEMO

Demo the world’s fastest database for vector, time-series, and real-time analytics

Start your journey to becoming an AI-first enterprise with 100x* more performant data and MLOps pipelines.

Process data at unmatched speed and scale
Build high-performance data-driven applications
Turbocharge analytics tools in the cloud, on premise, or at the edge

Nikos Tsoskounoglou

Co-Head Quantitative Trading, ADSS

“Outstanding vector-based database performance. Best of breed in time-series, which was my use case.”

Michael D.

Executive Director, Engineering and Execution, Enterprise(> 1000 emp.)

“Wall Street’s secret is out!”

Standy C.

Enterprise (>1000 emp.)

“Even after 20 years, it still produces moments of “wow, that was fast!””

Verified User in Capital Markets

Mid-Market (51-1000 emp.)

“Excellent TSD for real time analytics and machine learning flows.”

Vikas G.

Enterprise (>1000 emp.)

kdb Insights on AWS is ideal for scalable quant research in the cloud and supports SQL and Python with unparalleled speed. Seamless integration with AWS services, like Lambda, S3, and Redshift, empowers you to create a highly performant data analytics solution built for today’s cloud and hybrid ecosystems. Data-driven organizations choosing KX for faster decision making:

Book your personal demo

*required

A Verified G2 Leader for Time-Series Vector Databases

4.8/5 Star Rating

Based on time-series queries running in real-world use cases on customer environments.

To see how kdb performed in independent benchmarks that show similar on replicable data see: TSBS 2023, STAC-M3, DBOps, and Imperial College London Results for High-performance DB benchmarks.

Hybrid data: The key to unlocking generative AI accuracy

New research shows that using semantically structured data with generative AI makes answers three times more accurate than a Large Language Model (LLM) with SQL. Unfortunately, traditional databases and LLMs aren’t designed to work together, and building a bridge over that divide is hard.

Vector databases have emerged to fill that gap. But bolt-on vector search isn’t enough to improve query accuracy, simplify prompt engineering, or make building GenAI applications more cost-effective. A better solution is a hybrid approach that uses data as guide rails for accuracy.

Here’s a fresh look at how hybrid data computing helps deliver more accurate GenAI applications.

What is hybrid data, and why does it matter?

Hybrid computing is a style of building applications that fuses unstructured and structured data to capture, organize, and process data. Sounds simple, right?

It’s not.

Structured and unstructured data have different characteristics: structured data is carefully curated, accurate, and secure. Unstructured data and LLMs are used for serendipitous exploration. Combining them is an exercise left to the developer.

A recent MIT study showed that using Generative AI with unstructured data increased speed-to-insight by 44% and quality by 20%. The study studied nontrivial critical thinking tasks like planning and exploring a data set by data scientists. It revealed that unstructured data helps analysts, data scientists, HR leaders, and executives make and communicate better decisions.

However, the study also showed that those LLM-generated answers were often wrong, and 68% of participants didn’t bother to check the answers. For GenAI, inaccuracies are partly by design: neural networks mimic how our brains work. Like humans, they make mistakes, hallucinate, and interpret questions incorrectly.

For most enterprise applications, such inaccuracies are unacceptable. Hybrid computing can help guide LLMs to accurate, secure, reliable, creative, and serendipitous responses. But how do you create these hybrid computing systems that leverage the best of both worlds?

Three elements of hybrid data computing

A hybrid computing system has three elements:

1. Hybrid query processing

2. Semantic data layer

3. Hybrid data indexing

Let’s explore each.

Hybrid query processing

Hybrid data computing aims to use structured and unstructured data as a single organism to provide accurate responses to natural language queries. Using structured data to guide LLM responses is the first element of a hybrid model.

This demo explains how our vector database, KDB.AI, works with structured and unstructured data to find similar data across time and meaning, and extend the knowledge of Large Language Models.

For example, imagine our application answers prompts like “Why did my portfolio decline in value?” Using an LLM by itself yields a generic answer (on the left in the diagram below): market volatility, company news, currency fluctuations, or interest rate changes might be the source of a dip in your portfolio’s value.

Combining structured and unstructured data marries accuracy with serendipitous discovery. It provides more expansive and specific insights.

But we don’t want a generic LLM-generated answer – any public LLM can give us that. We want to know why MY portfolio declined and how MY performance relates to similar portfolios. That answer (on the right) is born by fusing structured data, similarity search, and LLM data in one hybrid system.

The first stage of answering this query is to examine the portfolio we’re interested in and compare it to similar portfolios. We find the account ID with a standard relational database lookup to do this. Then, we use vector embeddings to find portfolios similar to our own. This structured data query stage is shown at the top:

We find the account ID with a standard relational database lookup to do this. Then, we use vector embeddings to find portfolios similar to our own. This structured data query stage is shown at the top

Once the context is established, the hybrid system forwards data to our LLM to formulate the language-based answer we see above as one answer. In this way, hybrid query processing combines accuracy, similarity, and prompt-based answers to provide GenAI answers powered by structured data.

Semantic data layer

When answering prompts with structured data, the software must understand what data it has on hand to query, its relationships to other data, and how to retrieve and combine it before passing that guidance to the LLM to answer questions. For that, we need a semantic data layer—a roadmap to the structured data the system has at hand. New research from Dataworld showed that LLM systems with this semantic understanding of data are three times more accurate than those without (54% versus 16% accuracy).

A semantic data layer is like a data treasure map that describes the semantics of the business domain and the physical database schema in a knowledge graph. It can include synonyms and labels not expressible in SQL to help convert natural language queries into the right SQL queries needed to satisfy the query.

Researchers argue that this context must be treated as a first-class citizen, managed with metadata like a master data management (MDM) tool or data catalog, or ideally in a knowledge graph architecture. Otherwise, the crucial context that provides accuracy would need to be managed ad hoc.

Diagram showing the relationships between an insurance claim, its policy payment, premiums, and profit and loss

For example, to understand the relationships between an insurance claim, its policy payment, premiums, and profit and loss, you need to know how those data elements are related, like the portion of the graph below. Analysts and programmers traverse this graph to answer questions like, “What is the payout and cost of claim XYZ?” Once they find the claim and its associated policy, they can then understand payout, expenses, and reserves.

For our portfolio analysis question, a semantic data layer could help our hybrid query system understand the relationship between your portfolio and others, including the investment sector, portfolio size, risk, and other factors described in the semantic data layer. This helps ensure we have the right context and meaning in the data we provide the LLM.

Hybrid data indexing

The third requirement of hybrid computing is that it must work on constantly changing large data sets. Indexes are the fuel that powers all data-oriented computation. A hybrid data computing system must combine traditional structured data indexing, vector embeddings, and LLM data in one high-performance system (see diagram below).

Vector embeddings are a type of data index that uses numerical representations to capture the essence of unstructured data like text, images, or audio. They also extract the semantic relationships that can be integrated with a semantic data layer. Machine learning models create vector embeddings to help make unstructured data searchable.

Vector indexing refers to creating specialized data structures to enable efficient similarity search and retrieval of vector embeddings in a vector database. Like traditional database indexing, vector indexing aims to speed up queries by organizing the vector data to allow fast nearest-neighbor searches.

The elephant-in-the-room challenge associated with indexing unstructured data is that there’s so much of it. Financial services firms that analyze companies to make investment decisions must index thousands of pages of unstructured public from SEC filings like 10-K, 10-Q, 8-K, proxy statements, investor calls, and more.

The details of high-performance, scalable hybrid data indexing are outside the scope of this discussion. Still, it is the third foundational requirement of systems that process unstructured and structured data in one place. It is commonly done by “chunking” unstructured data into groups associated with similarly structured data and queried efficiently. Intelligent chunking uses vector embeddings to form a new type of hybrid index that combines unstructured and structured data in one optimized format.

Properly constructed hybrid computation, using hybrid indexing, can result in jaw-dropping economics. In one example using KDB.AI, a hybrid database, queries were found to be 100 times more efficient and 0.001% as expensive per query than non-optimized solutions. As a result, the solution was significantly more efficient, cost-effective and easier to use.

Accurate answers at scale

Hybrid data computing is an essential approach to the next wave of enterprise applications that use LLMs with carefully curated, structured data to produce more accurate, cost-effective answers at scale. The technologies to perform this kind of hybrid system are just now coming to market as vector databases mature and LLM use becomes more prevalent.

Learn how to integrate unstructured and structured data to build scalable GenAI applications with contextual search at our KDB.AI Learning Hub.

Image of a brain with the left side lit up and active.

Structure, meet serendipity: Integrating structured and unstructured data for left- and right-brain decisions

Stop Stuffing Your Time Series Data into Your Data Warehouse Like it’s Your Sock Drawer

The new dynamic data duo: Structured meets unstructured data to win on the generative AI playing field

Overcoming AI Challenges with KDB.AI 1.1

In 2023, KX launched KDB.AI, a groundbreaking vector database and search engine to empower developers to build the next generation of AI applications for high-speed, time-based, multi-modal workloads. Used in industries such as Financial Services, Telecommunications, Manufacturing and more, KDB.AI is today recognized as the world’s leading vector database solution for enterprise customers.

In our latest update, we’re introducing several new features that will significantly improve vector performance, search reliability, and semantic relevance.

Let’s explore.

Hybrid Search

The first is Hybrid Search, an advanced tool that merges the accuracy of keyword-focused sparse vector search with the contextual comprehension provided by semantic dense vector search.

Sparse vectors predominantly contain zero values. They are created by passing a document through a tokenizer and associating each word with a numerical token. The tokens, along with a tally of their occurrences, are then used to construct a sparse vector for that document. This is incredibly useful for information retrieval and Natural Language Processing Scenarios where specific keyword matching must be highly precise.

Dense vectors in contrast predominantly contain non-zero values and are used to encapsulate the semantic significance, relationships and attributes present within the document. They are often used with deep learning models where the semantic meaning of words is important.

With KDB.AI 1.1, analysts can tweak the relative importance of sparse and dense search results via an alpha parameter, ensuring highly pertinent data retrieval and efficient discovery of unparalleled insight.

Example Use Case

Consider a financial analyst looking for specific information on a company’s performance in order to assess investment risk. The analyst might search for “Company X’s Q3 earnings report” in which a sparse vector search would excel.

However, the analyst might also be interested in the broader context, such as market trends, competitor performance, and economic indicators that could impact Company X’s performance. Dense vector search could be used to find documents that may not contain the exact keywords but are semantically related to the query.

For example, it might find articles discussing a new product launched by a competitor or changes in trade policies affecting Company X’s industry.

With Hybrid Search the analyst is afforded the best of both worlds, and ultimately retrieves a comprehensive set of information to assist with the development of their investment strategy.

Temporal Similarity Search

The second key feature is the introduction of Temporal Similarity Search (TSS), a comprehensive suite of tools for analyzing patterns, trends, and anomalies within time series datasets.

Comprising of two key components, Transformed TSS for highly efficient vector searches across massive time series datasets and Non-Transformed TSS, a solution for near real-time similarity search of fast-moving data, TSS enables developers to extract insights faster than ever before.

Transformed Temporal Similarity Search

Transformed Temporal Similarity Search is our patent-pending compression model designed to dimensionally reduce time-series windows by more than 99%. With Transformed TSS, KDB.AI can compress data points into significantly smaller dimensions whilst maintaining the integrity of the original data’s shape.

It also enables the compression of varying sized windows into a uniform dimensionality, in valuable when working with time series data of different sample rates and window sizes.

By doing so, Transformed TSS significantly reduces memory usage and disk space requirements to minimize computational burden. And with the ability to attach compressed embeddings to prebuilt Approximate Nearest Neighbor (ANN) indexes, developers can expect significant optimization of retrieval operations in large scale embeddings.

Example Use Case

Consider a multinational retail corporation that has been experiencing stagnant growth and is now looking for ways to improve their business strategies.

With Transformed TSS, they can perform detailed analysis of their time series user interaction data, including clicks, views, and engagement times. This allows them to uncover hidden patterns and trends, revealing optimal times and contexts for ad placement.

Applying a similar concept to their retail operations, they can segment purchase history data into time windows, resulting in advanced similarity searches that unveil subtle purchase patterns, seasonal variations, and evolving consumer preferences.

Armed with these insights, the corporation can fine-tune their marketing strategies, optimize stock levels, and predict future buying trends.

Non-Transformed Temporal Similarity Search

Non-Transformed Temporal Similarity Search is a revolutionary algorithm designed for conducting near real-time similarity search with extreme memory efficiency across fast moving time-series data. It provides a precise and efficient method to analyze patterns and trends with no need to embed, extract, or store vectors in the database.

Non-Transformed TSS enables direct similarity search on columnar time-series data without the need to define an Approximate Nearest Neighbor (ANN) search index. Tested on one million vectors, it was able to achieve a memory footprint reduction of 99% percent, and a 17x performance boost over 1K queries.

	Non-Transformed TSS	Hierarchical Navigable Small Worlds Index
Memory Footprint	18.8MB	2.4GB
Time to Build Index	0s	138s
Time for Single Similarity Search	23ms	1ms (on prebuilt index)
Total Time for Single Search (5 neighbors)	23ms	138s+1ms
Total Time for 1000 searches (5 neighbors)	8s	139s

Example Use Case

Consider a financial organization looking to enhance its fraud detection capabilities and better respond to the increased cadenced and sophistication of attacks. With millions of customers and billions of transaction records, the organization requires a computationally efficient solution that will scale on demand.

With Non-Transformed Temporal Similarity Search the organization can analyze transactions in near real-time, without the need to embed, extract or store incoming records into a database prior to analysis. Inbound transactions are compared against historical patterns in the same account, and those exhibiting a high degree of dissimilarity can be flagged for further investigation.

We hope that you are as excited as we are about the possibilities these enhancements bring to your AI toolkit. You can learn more by checking out our feature articles over on the KDB.AI Learning Hub then try them yourself by signing up for free at KDB.AI

Related Resources

KDB.AI Data Sheet

Implementing RAG with KDB.AI and LangChain

The new dynamic data duo: Structured meets unstructured data to win on the generative AI playing field

On Wall Street, algorithmic trading has long been the revenue playing field: statistical analysis of micro-market movements helps traders predict how the profitability winds will blow at any millisecond. Profit, or “alpha,” is found by traders who create novel approaches to anticipate those price changes before their competitors win by analyzing the movement and momentum of structured data: numbers representing market prices, trades, and volume.

Today, the rise in technologies that make it cost-effective and easy to process unstructured data creates a new opportunity to gain an analytics edge: the combination of unstructured and structured data. This new source of insight is found in the connections between unstructured and structured data.

A new dynamic data duo, if you will.

Let’s explore this new insight opportunity and how firms can capitalize on it.

Structured data, meet unstructured data

Historically, unstructured data – found in PDF documents, on the web, or images, video and audio – has been explored but unexploited. Today, with the rise of generative AI and LLM technology, analyzing unstructured data creates new opportunities for insight.

In financial services, fusing structured market data with unstructured data like SEC filings, client interactions, analyst reports, social media sentiment, news, and more can reveal a new depth to insights. Combining structured and unstructured data is a revolutionary way to unlock data in ways never done before.

As we see below, unstructured data provides good, general advice about why an investment portfolio might decline in value, citing market volatility, news, currency fluctuations, and interest rate changes as reasons your portfolio might underperform.

But add individualized data from structured data sources, including account numbers, specific investments, their temporal performance, and indexes, and we get true insight (shown at right). We see why my portfolio declined. We see that my portfolio is outperforming its index. We see why my portfolio performed as it did. We see unique generative AI insight.

This dynamic duo of unstructured and structured data leverages three new computing elements.

Real-time unstructured data. Much of today’s structured data, like vital signs emitted from medical devices, are readily available. However, unstructured data for business applications, such as digital versions of doctors’ notes, are not as prevalent. But thanks to the rise in capabilities to analyze conversational data, these capabilities are becoming ubiquitous and cost-effective.

Digitized unstructured data. Thanks to the rise of generative AI, conversational messaging, and intelligent document processing technologies are more prevalent, less expensive, and easier to use than ever before. One area of this is conference call transcription and summarization, both available in tools like Zoom and Otter.ai. These tools emit a new source of digitized unstructured data useful for analysis.

Databases that fuse unstructured and structured data. Generative AI applications also require data management systems to connect and combine unstructured with structured data via vector embeddings, synthetic data sources, and data warehouses full of fused data to help prepare data for analysis. For example, KX’s new KDB.AI offering is designed to generate vector embeddings on unstructured documents and make them available for real-time queries.

The new dynamic duo and the role of LLMs

This dynamic data duo is not only at work on Wall Street; it’s also being used on Main Street applications. Consider healthcare. When you visit a hospital, doctors talk to you. That conversation generates unstructured data, with clues hidden inside your responses to the questions. Also, frontline staff takes your vital signs which provide a numerical read on how your body is actually performing.

The art of medicine is a doctor’s ability to connect numerical facts with clues revealed in conversations about how you feel. The National Institute of Health in Singapore implements this system today. Their system, Endeavor, combines conversations with vital signs in real-time to produce predictive, proactive insights.

For example, below, a machine learning algorithm evaluates unstructured doctor’s notes to identify references to abdominal pain reported by a patient.

Structured data comes from medical devices that monitor patient vital signs, and unstructured data comes from digital versions of doctor notes, patient utterances, medical journals, and research, providing a 360-degree view of insight to help improve care.

This unstructured and structured data is sent in real time to AI algorithms that silently search for and predict the likelihood of dozens of potential ailments and diseases, including eye disease, cardiac abnormalities, pulmonary disease, neurological disorders, septic shock, and oncology.

Predictions are returned to front-line medical staff who can make smarter recommendations. In this case, AI predicts that this patient is 95% likely to have appendicitis.

A new dynamic duo, a new source of insights

Traditionally, the two “data worlds” of unstructured and structured data did not collide. But today, unstructured data is easier and more cost-effective to extract than ever before, which makes it possible for the first time to easily combine with structured data to generate new insights.

This new dynamic duo of data affords new opportunities for insight hidden between conversational data and real-time streaming data, from Wall Street to Main Street. Databases designed to combine structured and unstructured data to unlock new hidden insights are the key arbiters of these data insights.

Related Resources

Harnessing the Power of Generative AI for Customer Success

KDB.AI Data Sheet

Why Do You Need Vector Processing to Win in the AI Era?

Build RAG-Enabled Applications with LlamaIndex and KDB.AI

Large Language Models (LLMs) have transformed natural language understanding, powering applications like chatbots, question answering, and summarization. However, their static datasets can limit relevance and accuracy. Retrieval-Augmented Generation (RAG) addresses this by enriching LLMs with up-to-date external data, enhancing response quality and contextual relevance. RAG is a powerful workflow, but building RAG-enabled applications is complex, requiring multiple steps and a scalable infrastructure.

To simplify this process, we’re excited to introduce the integration of KDB.AI with LlamaIndex, an open-source framework that streamlines the ingestion, storage, and retrieval of datasets for RAG applications. This integration enables developers to create sophisticated RAG-enabled applications with ease and efficiency.

In this blog post, we will explain how LlamaIndex and KDB.AI work together to enable RAG solutions and showcase some potential enterprise use cases that can benefit from this integration.

How Does LlamaIndex Enable RAG Solutions?

LlamaIndex is a data framework for building LLM-based applications, it specializes in augmenting LLMs with private or domain-specific data. LlamaIndex offers several types of tools and integrations to help users quickly develop and optimize RAG pipelines:

Data Loaders: Ingest your data from its native format. There are many connectors available including for .csv, .docx, HTML, .txt, PDF, PPTX, Pandas DataFrames, and more.
Parsing: Chunking data into smaller and more context specific nodes can greatly improve the results of your RAG application.
Embeddings: Transforming your data into vector embeddings is a key step in the RAG process. LlamaIndex integrates with many embedding models including OpenAI embedding models, Hugging Face Embeddings, LangChain Embeddings, Gemini Embeddings, Clip Embeddings, and many more.
Vector Stores / Index: Store embeddings within vector databases like KDB.AI to perform fast and accurate retrieval of relevant data to augment the LLM.
Hyperparameter Tuning: Optimize both chunk size and the number of top-k retrieved chunks to ensure your RAG pipeline generates the best possible results.
Retrievers: LlamaIndex offers a variety of retrievers to get the most relevant data from the index. Some examples are, Auto-Retrieval, Knowledge Graph retriever, hybrid retriever (BM25), Reciprocal Rerank Fusion retriever, Recursive Retriever, Ensemble Retriever, etc.
Postprocessors: LlamaIndex has many options for postprocessing retrieved data ranging from keyword matching, reranking, recency filtering, time-weighted reranking, sentence windows, long context reordering (fixes lost in the middle problem), prompt compression, retrieve surrounding nodes and others.
Data Agents: Agents are LLM-powered knowledge workers that use tools and functions to complete specific tasks. LlamaIndex supports and integrates with several agent frameworks such as “OpenAIAgent”.
Evaluation: Evaluate both the retrieval and generation phases of RAG with modules to test retrieval precision, augmentation precision, answer consistency, answer accuracy and more.
Llama Packs: Llama Packs are prepackaged modules to help users quickly compose an LLM application. Llama Packs can be initialized and run out-of-the-box or used as templates to modify to your use-case. You can see available Llama Packs on the Llama Hub. Examples include RAG pipelines, resume screener, and moderation packages.

KDB.AI Integration with LlamaIndex

KDB.AI is a high-performance vector database optimized for machine learning, natural language processing, and semantic search at scale. It stores and queries vector embeddings, with the ability to attach embeddings to a variety of indexes to facilitate rapid vector search and retrieval.

LlamaIndex can be used orchestrate ingestion, preprocessing, metadata tagging, and embedding for incoming data or the user’s query. Its integration with KDB.AI enables a variety of retrieval methods to find contextually relevant information from the KDB.AI vector store. The retrieved information can then be postprocessed and used to augment the LLM’s generated output, resulting in a more precise and contextually relevant response to the user’s question.

The following diagram illustrates the workflow of LlamaIndex and KDB.AI for RAG solutions:

LlamaIndex has functionality to help orchestrate each phase in the above diagram while still giving the user the flexibility to implement the RAG workflow in the best interest of the use-case.

Potential Use Cases

By combining LlamaIndex and KDB.AI, developers can leverage the power of RAG solutions for a variety of applications, such as:

Document Q&A: You can use LlamaIndex to ingest and index your unstructured data sources, such as manuals, reports, contracts, etc., and convert to vector embeddings. Then, you can use KDB.AI to store and query the vector embeddings at scale, using natural language queries. This way, you can provide fast and accurate answers to your users’ questions, without requiring them to read through lengthy documents.
Data Augmented Chatbots: You can use LlamaIndex to connect and structure your semi-structured data sources, such as APIs, databases, etc. Then, you can use KDB.AI to search and rank the relevant data items based on the user’s input and the chatbot’s context. This way, you can enhance your chatbot’s capabilities and provide more personalized and engaging conversations to your users.
Knowledge Agents: You can use LlamaIndex to index your knowledge base and tasks, such as FAQs, workflows, procedures, etc. Then, you can use KDB.AI to store and query the vector embeddings, using natural language commands. This way, you can create automated decision machines that can perform tasks based on the user’s input, such as booking appointments, ordering products, resolving issues, etc.
Structured Analytics: You can use LlamaIndex to ingest and index your structured data sources, such as spreadsheets, tables, charts, etc. Then, you can use KDB.AI to search and rank the relevant data rows or columns based on the user’s natural language query. This way, you can provide easy and intuitive access to your data analytics, without requiring the user to learn complex syntax or tools.
Content Generation: You can use LlamaIndex to ingest and index your existing content sources, such as blogs, articles, books, etc. Then, you can use KDB.AI to search and rank the most similar or relevant content items based on the user’s input or topic. This way, you can generate new and original content, such as summaries, headlines, captions, etc., using the LLM’s generation capabilities.

In this blog we have discussed how LlamaIndex and KDB.AI work together to empower developers to build RAG-enabled applications quickly and at scale. By integrating LlamaIndex and KDB.AI, developers can augment LLMs with contextually accurate information, and in turn, provide more precise and contextually relevant response to end user questions.

To find out more check out our documentation today!

Related Resources

KDB.AI Data Sheet

Implementing RAG with KDB.AI and LangChain

Why Do You Need Vector Processing to Win in the AI Era?

Transforming Enterprise AI with KDB.AI on LangChain

Artificial Intelligence (AI) is transforming every industry and sector, from healthcare to finance, from manufacturing to retail. However, not all AI solutions are created equal. Many of them suffer from limitations such as poor scalability, low accuracy, high latency, and lack of explainability.

That’s why we’re excited to announce the integration of KDB.AI and LangChain, two cutting-edge technologies designed to overcome these challenges and deliver unparalleled capabilities for enterprise AI via a simple and intuitive architecture that doesn’t require complex infrastructure or costly expertise.

In this blog post, I’ll give you a brief overview of each technology, discuss typical use cases, and then show you how to get started. Let’s begin.

What is KDB.AI?

KDB.AI is an enterprise grade vector database and analytics platform that enables real-time processing of both structured and unstructured time-oriented data. It’s based on kdb+, the world’s fastest time-series database, which is widely used by leading financial institutions for high-frequency trading and market data analysis.

With KDB.AI, developers can seamlessly scale from billions to trillions of vectors without performance degradation, thanks to its distributed architecture and efficient compression algorithms. It also supports various data formats, such as text, images, audio, video, and more.

With KDB.AI you can:

Create an index of vectors (Flat, IVF, IVFPQ, or HNSW).
Append vectors to an index.
Perform fast vector similarity search with optional metadata filtering.
Persist an index to disk.
Load an index from disk.

To learn more about KDB.AI, visit our documentation site.

What is LangChain?

LangChain is an open-source framework designed to simplify the creation of applications powered by language models. At its core, LangChain enables you to “chain” together components, acting as the building blocks for natural language applications such as Chatbots, Virtual Agents and document summarization.

LangChain doesn’t rely on traditional NLP pipelines, such as tokenization, lemmatization, or dependency parsing, instead, it uses vector representations of natural language, such as word embeddings, sentence embeddings, or document embeddings, which capture the semantic and syntactic information of natural language in a compact and universal way.

To learn more about LangChain, visit their documentation site.

How KDB.AI and LangChain work together

The integration of KDB.AI and LangChain empowers developers with real-time vector processing capability and state-of-the-art NLP models. This combination opens new possibilities and use cases for enterprise AI, such as:

Enterprise search: You can use LangChain to encode text documents into vectors, and then use KDB.AI to index and query them using advanced quad-search capabilities, combining keyword, fuzzy, semantic, and time-based search. This way, you can create a powerful and flexible enterprise search capability that can handle any type of query and return the most relevant results.

RAG at scale: You can use LangChain to implement Retrieval Augmented Generation (RAG), a novel technique that combines a retriever and generator to produce rich and diverse text outputs. You can then use KDB.AI to store and retrieve the vectors of the documents that are used by the retriever, enabling you to scale RAG to large and complex domains and applications.

Anomaly detection: You can use LangChain to detect anomalies in text data, such as spam, fraud, or cyberattacks, using pre-trained or fine-tuned models. You can then use KDB.AI to store and analyze the vectors of the anomalous texts, using clustering, classification, or regression techniques, to identify the root causes and patterns.

Sentiment Analysis: You can use LangChain to perform sentiment analysis on text data, such as customer reviews, social media posts, or news articles, using pre-trained or fine-tuned models. You can then use KDB.AI to store and visualize the vectors of the texts, using dashboarding, charting, or reporting tools, to gain insights into the opinions and emotions of customers, users, or audiences.

Text summarization: You can use LangChain to generate concise and informative summaries of long text documents, such as reports, articles, or books, using pre-trained or fine-tuned models. You can then use KDB.AI to store and compare the vectors of the original and summarized texts, using similarity or distance metrics, to evaluate the quality and accuracy of the summaries.

How to get started with KDB.AI and LangChain

If you’re interested in trying out KDB.AI on LangChain, I invite you to follow these simple steps.

Sign up for a free trial of KDB.AI.
Set up your environment and configure pre-requisites.
Work through the sample integration.

We also have some great resources from our evangelism team, including samples over on the KDB.AI learning hub and regular livestreams. And should you have any feedback, questions, or issues a dedicated team over on our Slack community.

Happy Coding!

RELATED RESOURCES

Implementing RAG with KDB.AI and LangChain

3 Steps To Make Your Generative AI Transformative Use Cases Sticky

Don’t Get the RAG Time Blues: Elevate Your Data Strategy With KDB.AI Server Edition 

The Montauk Diaries – Two Stars Collide

by Steve Wilcockson

Two Stars Collide: Thursday at KX CON [23]

My favorite line that drew audible gasps at the opening day at the packed KX CON [23]

“I don’t work in q, but beautiful beautiful Python” said Erin Stanton of Virtu Financial simply and eloquently. As the q devotees in the audience chuckled, she qualified her statement further “I’m a data scientist. I love Python.”

The q devotees had their moments later however when Pierre Kovalev of the KX Core Team Developer didn’t show Powerpoint, but 14 rounds of q, interactively swapping characters in his code on the fly to demonstrate key language concepts. The audience lapped up the q show, it was brilliant.

Before I return to how Python and kdb/q stars collide, I’ll note the many announcements during the day, which are covered elsewhere and to which I may return in a later blog. They include:

The launch of kdb.ai.
AWS & KX fully partnering to release kdb Insights as Managed Service on Amazon FinSpace,
The KX partnership with Data Intellect, key supporters and enablers of kdb-based analytics systems.

Also, Kevin Webster of Columbia University and Imperial College highlighted the critical role of kdb in price impact work. He referenced many of my favorite price impact academics, many hailing from the great Capital Fund Management (CFM).

Yet the compelling theme throughout Thursday at KX CON [23] was the remarkable blend of the dedicated, hyper-efficient kdb/q and data science creativity offered up by Python.

Erin’s Story

For me, Erin Stanton’s story was absolutely compelling. Her team at broker Virtu Financial had converted a few years back what seemed to be largely static, formulaic SQL applications into meaningful research applications. The new generation of apps was built with Python, kdb behind the scenes serving up clean, consistent data efficiently and quickly.

“For me as a data scientist, a Python app was like Xmas morning. But the secret sauce was kdb underneath. I want clean data for my Python, and I did not have that problem any more. One example, I had a SQL report that took 8 hours. It takes 5 minutes in Python and kdb.”

The Virtu story shows Python/kdb interoperability. Python allows them to express analytics, most notably machine learning models (random forests had more mentions in 30 minutes than I’ve heard in a year working at KX, which was an utter delight! I’ve missed them). Her team could apply their models to data sets amounting to 75k orders a day, in one case 6 million orders over a 4 months data period, an unusual time horizon but one which covered differing market volatilities for training and key feature extraction. They could specify different, shorter time horizons, apply different decision metrics. ”I never have problems pulling the data.” The result: feature engineering for machine learning models that drives better prediction and greater client value. With this, Virtu Financial have been able to “provide machine learning as a service to the buyside… We give them a feature engineering model set relevant to their situation!,” driven by Python, data served up by kdb.

The Highest Frequency Hedge Fund Story

I won’t name the second speaker, but let’s just say they’re leaders on the high-tech algorithmic buy-side. They want Python to exhibit q-level performance. That way, their technical teams can use Python-grade utilities that can deliver real-time event processing and a wealth of analytics. For them, 80 to 100 nodes could process a breathtaking trillion+ events per day, serviced by a sizeable set of Python-led computational engines.

Overcoming the perceived hurdle of expressive yet challenging q at the hedge fund, PyKX bridges Python to the power of kdb/q. Their traders, quant researchers and software engineers could embed kdb+ capabilities to deliver very acceptable performance for the majority of their (interconnected, graph-node implemented) Python-led use cases. With no need for C++ plug-ins, Python controls the program flow. Behind-the-scenes, the process of conversion between NumPy, pandas, arrow and kdb objects is abstracted away.

This is a really powerful use case from a leader in its field, showing how kdb can be embedded directly into Python applications for real-time, ultra-fast analytics and processing.

Alex’s Story

Alex Donohoe of TD Securities took another angle for his exploration of Python & kdb. For one thing, he worked with over-the-counter products (FX and fixed income primarily) which meant “very dirty data compared to equities.” However, the primary impact was to explore how Python and kdb could drive successful collaboration across his teams, from data scientists and engineers to domain experts, sales teams and IT teams.

Alex’s personal story was fascinating. As a physics graduate, he’d reluctantly picked up kdb in a former life, “can’t I just take this data and stick it somewhere else, e.g., MATLAB?”

He stuck with kdb.

“I grew to love it, the cleanliness of the [q] language,” “very elegant for joins” On joining TD, he was forced to go without and worked with Pandas, but he built his ecosystem in such a way that he could integrate with kdb at a later date, which he and his team indeed did. His journey therefore had gone from “not really liking kdb very much at all to really enjoying it, to missing it”, appreciating its ability to handle difficult maths efficiently, for example “you do need a lot of compute to look at flow toxicity.” He learnt that Python could offer interesting signals out of the box including non high-frequency signals, was great for plumbing, yet kdb remained unsurpassed for its number crunching.

Having finally introduced kdb to TD, he’s careful to promote it well and wisely. “I want more kdb so I choose to reduce the barriers to entry.” His teams mostly start with Python, but they move into kdb as the problems hit the kdb sweet spot.

On his kdb and Python journey, he noted some interesting, perhaps surprising, findings. “Python data explorers are not good. I can’t see timestamps. I have to copy & paste to Excel, painfully. Frictions add up quickly.” He felt “kdb data inspection was much better.” From a Java perspective too, he looks forward to mimicking the developmental capabilities of Java when able to use kdb in VS Code.”

Overall, he loved that data engineers, quants and electronic traders could leverage Python, but draw on his kdb developers to further support them. Downstream risk, compliance and sales teams could also more easily derive meaningful insights more quickly, particularly important as they became more data aware wanting to serve themselves.

Thursday at KX CON [23]

The first day of KX CON [23] was brilliant. a great swathe of great announcements, and superb presentations. For me, the highlight was the different stories of how when Python and kdb stars align, magic happens, while the q devotees saw some brilliant q code.

Establishing AI readiness

Factory-like AI data pipelines

Factory-like AI development

Factory-like AI automation

Factory-like AI model operations and monitoring

AI factories produce innovation, energy, and ROI

Demo the world’s fastest database for vector, time-series, and real-time analytics

Book your personal demo

Based on time-series queries running in real-world use cases on customer environments.

Adopting a leaders’ mindset toward GenAI

Leaders build AI factories

Learning from high performers

Related content

Key takeaways

Take a holistic approach to AI

Ensure you have AI-ready data

Manage risk and compliance with AI

Focus on high-performing technology for maximum ROI

Build ‘AI factories’

Accelerate AI with NVIDIA’s Grace Hopper superchip

Identify patterns in market data

Next steps

Related content

Demo the world’s fastest database for vector, time-series, and real-time analytics

Book your personal demo

Based on time-series queries running in real-world use cases on customer environments.

Hybrid Search

Example Use Case

Temporal Similarity Search

Transformed Temporal Similarity Search

Example Use Case

Non-Transformed Temporal Similarity Search

Example Use Case

Related Resources

Structured data, meet unstructured data

The new dynamic duo and the role of LLMs

A new dynamic duo, a new source of insights

Related Resources

How Does LlamaIndex Enable RAG Solutions?

KDB.AI Integration with LlamaIndex

Potential Use Cases

Related Resources

What is KDB.AI?

What is LangChain?

How KDB.AI and LangChain work together

How to get started with KDB.AI and LangChain

RELATED RESOURCES