Hello! I am Cesar Conejo Villalobos. I have a major in Actuarial Sciences from the University of Costa Rica and a master’s degree in Statistics for Data Science at Carlos III University of Madrid, Spain. Additionally, I have five and a half years of experience in the bank and finance industry. Along this path, I have learned and applied different back-end and front-end data science techniques, especially in data storage infrastructure, data analysis, and machine learning algorithms focused on fraud detection, financial customer segmentation and attrition using SQL and R.
I am also a history and philosophy lover, a passion I try to mix with data science. The most helpful thing that I have learned from history is that it is not much use to make predictions and calculations in the medium and long term. The world always goes on its own business! In the end, things and situations that are unpredictable or very difficult to occur can appear, resulting in the analysis being out of date. So, What can we do in this world of chaos? In my case, I consider that the best thing we can do is quantify the uncertainty with probability. =)
M.S. Statistics for Data Science, 2021
Carlos III University of Madrid
B.S. in Actuarial Sciences, 2014
University of Costa Rica
I worked on three fronts:
Technical leadership: I provided mentorship and technical support for the data science team tasked with detecting the fraud patterns in credit/debit cards and acquiring business. I also reviewed experiments and assisted in the predictions models for other areas of the bank as the Compliance and Credit departments.
Information quality: I conducted activities related to ensuring data quality and consistency. I designed interactive dashboards for showing the evolution of the main KPIs and proactively challenge the presence of anomalies and deviations. Dashboards were used for data-driven decisions of the Business team in order to explain historical and current events related to fraud tendencies and chargebacks.
Technological solutions: I was responsible for the migration from structured manual business procedures to automatical systems, especially in the Chargebacks department.
Responsibilities included:
Acquirer commerce review: I created SQL queries and R scripts to detect anomalies and deviations of the daily transactions associated with the distinct business. I used several techniques for detecting fraud cases, as Outlier detection, Association rules, Time series, and other non-parametric approaches.
Actuarial Analysis: I applied actuarial validation techniques for the pre-feasibility of new insurance policies for acquirer commerce. Also, I estimated the required technical reserves for this new product based on three different actuarial techniques: Chain Ladder, Buhlmann, and Buhlman-Straub. Also, I employed survival analysis in order to determine the life expectancy of the credit/debit cards produced by the bank.
Site with the official pdf document and Python code of my thesis project for the master’s degree in Statistics for Data Science. My thesis applies Convolutional Neural Networks (CNNs), a popular Deep Learning framework used for detecting patterns, especially in images and photos. But, instead of analyzing pictures, I am applying the CNNs in the context of statistical inference on high-dimensional data and Multiple Hypothesis testing.
Nonparametric methods such as Kaplan-Meier and Nelson-Aalen is applied to the Hard Drive Data of Backblaze.
Identifying and predicting anomalies in time series is crucial for decision making. So, we are going to use an option in R for doing the work.
In this project, we review two methods of unsupervised techniques: k-means and hierarchical clustering.
An introduction to supervised machine learning algorithms, especially methods for classification problems.