Machine learning: Using embedPy to apply LASSO regression

23 Oct 2018 | , , ,
Share on:

By Samantha Gallagher

 

The use of kdb+ for machine learning in financial technology and other industries is expanding following the release by Kx of the powerful embedPy interface, which allows the kdb+ interpreter to manipulate Python objects, call Python functions, and load Python libraries. Now Python and kdb+ developers can fuse both technologies together, allowing for a seamless application of q’s high-speed analytics and Python’s expansive collection of libraries.

In our latest technical white paper, Kx engineer Samantha Gallagher introduces embedPy, covering both a range of basic tutorials as well as a comprehensive solution to a machine-learning project. EmbedPy is available on GitHub to use with kdb+ V3.5+ and Python 3.5 or higher, for macOS or Linux operating systems and Python 3.6 or higher on the Windows operating system. The installation directory also contains a README.txt about embedPy, and an example directory containing thorough examples.

You can read Samantha’s paper on the Kx Developer’s site, code.kx.com here.

SUGGESTED ARTICLES

kdb+ and NAG library with PyQ

Using the NAG Library for Python with kdb+ and PyQ

25 Jun 2019 | , ,

This paper provides detailed instructions on how to use the Numerical Algorithms Group (NAG) Library for Python with kdb+ and PyQ. The NAG Library contains more than 1,800 mathematical and statistical routines, and is accessible by numerous programming languages (including Python, C++, Java, Fortran, etc.).

A comparison of Python and q for data problem solving

8 May 2019 |

This article takes a simple, real-life problem and analyzes different solutions in Python and q. The problem leads us to discover nice areas of both programming languages, including vector operations, Einstein summation, adverbs and functional form of select statements. Each solution has lessons that deepen our IT knowledge, especially when we consider performance.

kx and machine learning

Machine Learning Toolkit Update: Multi-parameter FRESH and updated utilities

25 Apr 2019 | , ,

This latest toolkit release, is the first in a series of planned releases in 2019 that will add updates to the functionality of the FRESH (Feature Extraction based on Scalable Hypothesis tests) algorithm and the addition of a number of accuracy metrics, preprocessing functions and utilities. In conjunction with code changes, modifications to the namespace structure of the toolkit have been made to streamline the code and improve user experience.