Data ProcessingWorkshop

Learn high performance data processing with Clojure dataset. Learn how to work with columnar data and how to get most performance when processing datasets of realistic sizes which in our case means millions of rows and tens of columns.

Get notified when we're open for registration.

By continuing you agree to terms and conditions and privacy policy.

Your instructor

Chris attended the University of Colorado at Boulder earning a BS in computer science. Since then, his technical excellence has driven the development of many industry-leading products. Startups love him: with Anark, he brought advanced computer graphics and designer-developer interactions to the web. At SRC he deployed new lossless compression algorithms for GIS systems. At NVIDIA, he led an international team to develop the powerful and popular PhysX debugger. And, in recent years he has focused on applying neural networks and other data-intensive techniques to stubborn business problems, leaving a trail of happy customers in his wake. He has the distinction of both releasing software on CD-ROM, and managing cloud-native machine learning systems with high throughput and availability. Outside of work Chris enjoys his family, rock climbing, and pushing computers to perform beyond their limits.

Chris Nuernberger
Chris Nuernberger
Managing Partner, TechAscent

Workshop details

What will your learn?

Learn high performance data processing with Clojure dataset. Learn how to work with columnar data and how to get most performance when processing datasets of realistic sizes which in our case means millions of rows and tens of columns.

High level design

High level overview of tech.ml.dataset

Dataset Creation

Learn how to work with different data formats.

Printing

Learn how to work with different printing options to suit your workflow.

Dataset manipulation

Get to know how to manipulate dataset and work with them.

Access to column values

Learn how to access column values and understand what advantage they give us.

Subrect Selection

Understand two methods to select subrects of information from a dataset.

Operations on datasets

Add, Remove, Update, Sort-by, Filter, Group-by, rowwise operations.

Element-wise operations

Learn how to work with element-wise map - emap

DateTime Types

Learn how to read and manipulate DataTime Types.

Writing a dataset out

Write data to different formats.

What should you know?

This workshop is for developers who are familiar with Clojure on the JVM. This workshop will introduce you to high performance computing and how to use tech.ml.dataset library.

When does it take place?

2023

Frequently asked questions

When does the workshop start?
We are working hard to bring you the best possible workshop. Sing up to get notified when we are ready for registration.
Can I get reimbursed?
We hope so! We've had students in the past ask their boss or HR department and get reimbursement for the ClojureStream Courses and we believe the same thing can work here. More and more organizations are encouraging their teams to contribute by learning.
What is the refund policy?
If you are no longer able to participate in the workshop or feel it's not a fit, you can withdraw your enrollment in exchange for a full refund provided you make your request no later than one 14 days before workshop start. No withdrawals or refunds are possible after this date. No deferrals or transfers are possible at any time. If we won't be able to deliver the workshop after purchase we'll do a full refund to all participants.
Is special software or technology required?
The workshop works best if you have a keyboard, so we recommend laptops instead of mobile devices. You will also have to install Zoom, a free to install tool that's used in the workshop.