PyCon Israel 2022

🇺🇸 Life, Death, and Shopping
2022-06-29, 14:00–14:50, PyData

A step-by-step introduction to purchase prediction. Also applicable to survival analysis and churn prediction. Including implementation in PySpark.


When dealing with survival analysis, the model's success is predicting death correctly. But it can also predict an engine failure, abandonment, or even purchases.
In purchase prediction, survival analysis, or churn prediction, the data is usually labeled or artificially labeled by a set of rules- such as inactivity for 30 days equivalent to churn. But the data structure is different from classical machine learning, and the data handling and modeling are different accordingly.
In this lecture, we will cover the data structures and aggregations for such analysis focusing on time aggregations using pyspark and what NLP got to do with any of it.


Session language – English Target audience – Data Scientists Other (target audience) – In this lecture there is something for everyone, beginners wishing to learn what to learn, advanced that need a reminder and maybe introduction to a new topic, R&D for new concepts, and in between.