Browsing: Data Science
Clustering: A simple way to group similar rows and prevent unnecessary data processingIn my previous article, I explained how to optimise SQL queries using partitioning:Now, I’m…
Data Visualization, Data StorytellingSimplify your overwhelmed charts by using slope charts: a tutorial in Python AltairImage by AuthorWe may plot charts to include as many concepts…
During a road trip we discussed how this tool was interesting but not necessarily useful. My friend could plug in numbers and see what happened, but…
A step-by-step walkthrough of inter-participant and intra-participant classification performed on wearable sensor data of runnersImage by authorRunning data collected using wearable sensors can provide insights about…
How to improve the performance of your Retrieval-Augmented Generation (RAG) pipeline with these “hyperparameters” and tuning strategiesTuning Strategies for Retrieval-Augmented Generation ApplicationsData Science is an experimental…
How cloud computing and analytics engineering forced the transition from ETL to ELTImage generated via DALL-EETL (Extract-Transform-Load) and ELT (Extract-Load-Transform) are two terms commonly used in…
Experimenting with Large Language Models for freeArtistic representation of the LangChain, Photo by Ruan Richard Rodrigues, UnsplashEverybody knows that large language models are, by definition, large.…
In 3 words: timeliness, methodology, and digestibilityA couple of weeks ago, I wrote about building systems to generate more quality insights. I presented how you could…
In this article, I explore the public transport systems of four selected cities relying on General Transit Feed Specification and various tools of spatial data science.I…
With A Tail of Cat Food PreferencesPhoto by Anastasiia Rozumna on UnsplashWelcome to the ‘Courage to learn ML’. This series aims to simplify complex machine learning…