Class Central is learner-supported. When you buy through links on our site, we may earn an affiliate commission.

YouTube

Immutable Data Science with Datomic, Spark and Kafka

Strange Loop Conference via YouTube

Overview

Coursera Plus Monthly Sale: All Certificates & Courses 40% Off!
Explore an innovative approach to data science architecture in this conference talk that leverages Datomic, Spark, and Kafka for scalable real-time analysis of production data without traditional ETL techniques. Discover how immutability, consistent timelines, and multi-database querying enable machine learning models with full traceability in a microservices architecture. Learn about modern stored procedures, pass-by-reference queries, horizontal read scalability, and an immutable messaging substrate. Gain insights into an alternative to lambda and kappa architectures, addressing sensitive data encryption and information security concerns. Understand how this solution eliminates the need for ETL and database synchronization pipelines while maintaining scalability and isolation for both transactional and analytical use cases.

Syllabus

Intro
Microservices
Board
How is it stored?
How is it queried?
How do we get ?
Enriched entity
Entity from cursor and id
Multiple DBS
No interference
Using it
Sample message
Sample query
Model service
Output
Scoring time
Training time
RDDs: our use case
Sharding queries
Data access
Learning curve
Testimonials

Taught by

Strange Loop Conference

Reviews

Start your review of Immutable Data Science with Datomic, Spark and Kafka

Never Stop Learning.

Get personalized course recommendations, track subjects and courses with reminders, and more.

Someone learning on their laptop while sitting on the floor.