-
-
3 SpaCy Tricks for Efficient Text Processing & Entity Recognition
In this article, we will explore three essential spaCy tricks that every developer should have in their toolkit to maximize processing speed and customize entity recognition.
-
What the Agentic Era Means for Data Science
Learn how AI agents are reshaping data science workflows and which skills practitioners need in 2026.
-
7 Steps to Mastering Time Series Analysis with Python
This article breaks down 7 key steps to help you analyze and forecast time series data with Python.
-
How to Write to Files in Python: A Beginner’s Guide
Learn how to write, append, and save text, CSV, and JSON files in Python using native file handling tools that work out of the box.
-
5 Fun Papers That Explain LLMs Clearly
Want to understand LLMs better? Start with these five foundational papers that explain how they work.
-
A Gentle Primer on LLM Explainability
This article discusses LLM explainability and outlines the advances, trends, and ongoing developments in this important field of study.
-
10 GitHub Repositories for Modern Database Systems and Tools
Explore 10 top open-source GitHub repositories for modern databases, analytics, SQL, caching, monitoring, replication, PostgreSQL, SQLite, and AI agent memory.
-
Mocking a Year of IoT Sensor Time Series Data with Mimesis
In this guide, you will learn the process of generating a year's worth of daily temperature readings, mimicking a seasonal curve that looks like real — all together with device-level metadata, and ready to build based on open-source frameworks.
-
5 Must-Know Python Concepts for Data Scientists
In this article, we will dive deep into five must-know Python concepts that will help you transition from writing clunky, slow spaghetti code to constructing lightning-fast, production-grade, and beautifully functional data pipelines.
-
Practical NLP in the Browser with Transformers.js
This tutorial covers three NLP tasks: text classification, zero-shot labelling, and question answering using Transformers.js's pipeline() API.
-
The ‘Entry-Level’ Gatekeeper: Auditing Job Descriptions with Textstat
This article shows how to use free, open-source tools like Python and its Textstat library to build a script that automates the process of capturing "gatekeeping language" in job descriptions before publishing them.
-
Tweaking Local Language Model Settings with Ollama
In this article, we will go deep under the hood of Ollama's configuration engine, exploring how to fine-tune local language model parameters.
-
7 Real World AI Projects to Build in 2026 (with Guides)
Explore seven practical AI projects that automate real workflows, including job search, web research, investment research, market trend analysis, invoice processing, chart digitization, and personalized exercise training.
-
Pandas GroupBy Explained With Examples
Learn how to use Pandas GroupBy to summarize, compare, and analyze grouped data with simple, practical examples.
-
5 Scipy.stats Tricks for Simulating ‘What If’ Scenarios
In this article, we will take a look under the hood of scipy.stats, exploring five essential tricks to design high-performance, rigorous simulations using only NumPy and SciPy.
-
Visual Debugging Tools for Machine Learning Workflows
In this article, we cover three topics: what to visualize during training, the tools that provide those visualizations, and the methods to capture model computations directly using hooks and breakpoints.
-
Top 7 Python Libraries for Large-Scale Data Processing
This article covers Python libraries that make large-scale data processing faster, more scalable, and easier to manage across modern data workflows.
By Bala Priya C, KDnuggets Contributing Editor & Technical Content Specialist on May 26, 2026 in Python
-
Auditing Model Bias with Balanced Datasets with Mimesis
Learn how to use Mimesis library to generate a balanced, counterfactual dataset that helps analyze potential bias in your models.
-
5 More Must-Know Python Concepts
Let's take a look at five more fundamental concepts that every Python developer should have in their toolkit.
-
Easy Agentic Tool Calling with Gemma 4
In this tutorial, we will give Gemma 4 two new tools and watch the model decide, on its own, when to look around and when to compute.
-
System Design Interview Questions: A Handy Collection
Ace system design interviews with 10 GitHub repositories packed with fundamentals, proven patterns, and real questions to help you design scalable systems with confidence.
-
Best Small Language Models on Hugging Face Right Now!
Take a curated look at the best small language models currently available on Hugging Face, what each one is actually good at, the benchmark numbers that back those claims up, and the code to get started with each one.
-
Anonymizing Production Data for Data Science with Mimesis
Learn how to utilize Python's Mimesis library for anonymizing sensitive production data, based on a step-by-step example to try yourself.
-
SQL Window Functions Beyond Basics: Solving Real Business Problems
You know window functions, but do you know how to use them to solve business problems? You will after you read this article.
By Nate Rosidi, KDnuggets Market Trends & SQL Content Specialist on May 20, 2026 in SQL
-
10 GitHub Repositories to Master Quant Trading
From your first backtest to a real trading system, here are GitHub repos that can seriously level up your quant trading skills fast.
-
How to Get the Most Out of Claude Cowork
Cowork is an autonomous agent that lives inside the Claude Desktop app, which has direct access to a folder on your computer, and can plan, execute, and deliver real work.
-
Top 10 Python Libraries for Data Engineering in 2026
Want to level up your data engineering toolkit? Here are some Python libraries that'll make your pipelines faster, cleaner, and easier to maintain.
-
The Hidden Skill Gap: Why Knowing SQL + Python Isn’t Enough Anymore
This article is about the gap between what candidates prepare for and what companies actually need right now.
-
5 Cool Things I Did with Local Language Models
I have been running local models as part of my daily workflow for some time, and what surprised me most is how often local turned out to be the better choice, not a compromise.
-
TurboQuant: Is the Compression and Performance Worth the Hype?
How does it boost efficiency without losing accuracy? Is it really worth the hype?
-
5 Must-Know Python Concepts
In this article, we will explore five fundamental concepts that every Python developer should have in their toolkit.
-
Time-Series Feature Engineering with Python Itertools
Learn how to use Python itertools to build efficient and scalable time series features.
-
5 Small Language Models for Agentic Tool Calling
Here are 5 small language models that hare one important trait: they all support structured tool calling in a compact, open-weight package.
-
How AI Agents Will Transform Data Science Work in 2026
Discover how AI agents will revolutionize data science in 2026, and learn why they won't replace you but will make you a faster, smarter analyst.
-
10 GitHub Repositories to Master Self-Hosting
Learn how modern infrastructure works through self-hosting: containers, reverse proxies, monitoring, backups, networking, and systems operations.
-
Using Polars Instead of Pandas: Performance Deep Dive
In this article, we explore three real data problems using real questions where Polars outpaces Pandas on every metric.
By Nate Rosidi, KDnuggets Market Trends & SQL Content Specialist on May 12, 2026 in Python
-
5 Useful Python Scripts for Time Series Analysis
Time series data is common across finance, operations, engineering, and research. These five Python scripts cover the analysis tasks that come up repeatedly.
By Bala Priya C, KDnuggets Contributing Editor & Technical Content Specialist on May 12, 2026 in Python
-
Guardrails for LLMs: Measuring AI ‘Hallucination’ and Verbosity
This article discusses how to implement an infrastructure for measuring and controlling overly verbose LLM responses.
-
Build an AI-Powered Learning Management System That Actually Trains People
Learn how to build an AI-powered Learning Management System from scratch using Ollama, FastAPI, and React. A step-by-step guide for beginner and intermediate developers.
-
10 GitHub Repositories to Master FastAPI
Learn FastAPI through templates, examples, guides, auth tools, microservices, full-stack starters, and machine learning projects.
-
Stop Wasting Tokens: A Smarter Alternative to JSON for LLM Pipelines
If you are feeding structured data into an LLM, there is a good chance you are paying a JSON tax.
-
How to Build Vector Search From Scratch in Python
Learn how to build a vector search engine from scratch in Python with embeddings, similarity scoring, and basic retrieval logic.
By Bala Priya C, KDnuggets Contributing Editor & Technical Content Specialist on May 8, 2026 in Python
-
Building Modern EDA Pipelines with Pingouin
Learn how to build a holistic pipeline for rigorous, statistical EDA, validating several important data properties.
-
7 Everyday Distributions Explained Simply
This article is a quick, everyday tour of seven distributions you'll actually recognize once you know what to look for. No heavy math. No gatekeeping.
-
Abacus AI Review: Features, AI Agents & Automation Explained (Honest Guide)
A detailed Abacus AI review covering ChatLLM, Abacus AI Agent, Claw, automation, app building, image and video generation, pricing, pros, cons, and who should use it.
-
How to Set Up Claude Code Channels Locally
Learn how to connect Claude Code to Discord locally, pair your account, control access, and keep the bot running reliably.
-
7 OpenCode Plugins That Make AI Coding More Powerful
Learn which seven OpenCode plugins add memory, search, Gemini, terminal control, analytics, and reusable skills to make AI coding workflows stronger.
-
Baptists and Bootleggers: The Hidden Coalition Behind ‘Data-Driven’ Decisions
One is genuine curiosity. The other is someone who already knows what they want and went looking for a number to back it up.
-
5 Fun Projects Using Claude Code
Turn Claude Code into your AI coding partner with these 5 hands-on projects, from beginner-friendly builds to advanced agent workflows.
-
Testing SQL Like a Software Engineer: Unit Testing, CI/CD, and Data Quality Automation
How to turn an interview-style SQL query into a production-ready, testable, version-controlled workflow.
By Nate Rosidi, KDnuggets Market Trends & SQL Content Specialist on May 4, 2026 in SQL
-
How to Deploy Your First App on FastAPI Cloud
Learn how to build, test, deploy, and monitor your first FastAPI Cloud app, a simple live gold and silver dashboard.
-
7 Practical Ways to Reduce Claude Code Token Usage
Claude Code token costs usually come from bloated context, not just long prompts. These 7 practical tactics help reduce waste without hurting quality.
-
The “Robust” Data Scientist: Winning with Messy Data and Pingouin
This article uncovers the craftsmanship of using robust statistics in data science processes: illustrating what to do when data fail tests due to not meeting standard assumptions.
-
Open Weight Text-to-Speach with Voxtral TTS
Learn how the Voxtral TTS model works, what makes its voice cloning and low‑latency performance special, and how to start generating speech with just a few lines of Python code.
-
Learn The Most In-Demand Tech Skills for FREE
From April 30 - May 10, Zero To Mastery's entire course catalogue is 100% free.
-
Building Agentic AI Systems with Microsoft’s Agent Framework
Read this technical walkthrough of safety, MCP, workflow orchestration, and agentic RAG in Python.
-
5 Powerful Python Decorators to Build Clean AI Code
This article outlines five particularly useful Python decorators that, based on developers' experience, haven proven themselves effective to make AI code cleaner.
-
Self-Hosted LLMs in the Real World: Limits, Workarounds, and Hard Lessons
This article is about what actually happens when you take self-hosted LLMs seriously: not the benchmarks, not the hype, but the real operational friction most tutorials skip entirely.
-
Local Whisper Audio Transcription
Learn how to eranscribe audio locally using Faster‑Whisper and Python, with an emphasis on privacy‑first and CPU/GPU‑ready.
-
A/B Testing Pitfalls: What Works and What Doesn’t with Real Data
Learn why most “winning” experiments fail in production, and how top companies avoid this.
-
10 Python Libraries for Building LLM Applications
Learn the top Python frameworks for LLM apps covering fine-tuning, model loading, serving, RAG pipelines, multi-agent systems, and evaluation.
-
7 Practical OpenClaw Use Cases You Should Know
Learn how people are using OpenClaw to automate workflows, build custom agents, boost productivity, and turn AI into practical action.
-
AI Engineering Hub Breakdown: 10 Agentic Projects You Can Fork Today
These 10 projects will teach you agent engineering faster than any tutorial.
-
7 Specific Unconventional Things to Do with Language Models
These ares seven unconventional uses of LLMs that go far beyond usual chat interface and conversations.
-
5 GitHub Repositories to Learn Quantum Machine Learning
If you want to learn quantum machine learning in 2025, these five GitHub repositories can get you started in hours, not months.
-
10 GitHub Repositories To Master Claude Code
Learn faster, build smarter, and unlock the full power of Claude Code through real examples, reusable templates, prompts, workflows, subagents, and system design.
-
Seeing What’s Possible with OpenCode + Ollama + Qwen3-Coder
Run a powerful, private AI coder locally with OpenCode, Ollama & Qwen3-Coder. Free, offline, and unlimited.
-
Advanced Pandas Patterns Most Data Scientists Don’t Use
Learn method chaining, pipe(), efficient joins, optimized groupby operations, and vectorized logic to write faster and cleaner pandas code
By Nate Rosidi, KDnuggets Market Trends & SQL Content Specialist on April 21, 2026 in Python
-
5 Docker Best Practices for Faster Builds and Smaller Images
By applying a few smart Docker practices, you can build faster images, and keep them clean, compact, and production-ready.
-
How to Crawl an Entire Documentation Site with Olostep
Automatically collect documentation pages, clean and structure the content, and turn website data into AI-ready output using a few lines of code.
-
Merging Language Models with Unsloth Studio
Merge LLMs easily with Unsloth Studio's no-code GUI and combine models without retraining.
-
5 Free Ways to Host a Python Application
Explore five beginner-friendly platforms that let you host Python apps for free, compare their limits, and pick the right one.
-
I Vibe Coded a Tool to That Analyzes Customer Sentiment and Topics From Call Recordings
Build an AI customer sentiment analyzer for call recordings using Whisper, BERTopic & Streamlit with this open-source step-by-step guide with code.
-
5 Useful Python Scripts for Advanced Data Validation & Quality Checks
From missing values to schema mismatches, data issues appear in many forms. These five Python scripts provide smart, automated validation for modern data workflows.
By Bala Priya C, KDnuggets Contributing Editor & Technical Content Specialist on April 17, 2026 in Python
|