Posts Tagged "python"

Mistral 7B on consumer hardware

Run Mistral 7B locally on Mac with Ollama for fast seed data generation. Learn CLI setup, prompt formatting, and downstream parsing to generate thousands of samples on consumer hardware.

t-distributed Stochastic Neighbor Embedding says "what"

Understand t-SNE dimensionality reduction for visualizing high-dimensional data. Covers perplexity parameter tuning, implementation with TF-IDF vectors, and interactive visualization best practices.

Mining word collocations

Extract common bigrams and trigrams from text using Gensim and NPMI scoring. Learn to mine jargon, phrases, and collocations from customer reviews, feedback, and text corpora.

Kernel Density Estimation

Create effective KDE plots with Seaborn. Learn optimal bin settings, histogram layering, and lesser-known parameters for better distribution visualization.

Favorite Jupyter Notebook Settings

Essential Jupyter Notebook customizations to improve your data science workflow. Configuration tips for enhanced productivity and better user experience.

Subscribe

All the latest posts directly in your inbox.