Skill Bolt
Initializing Platform
Skill Bolt
Marketplace Services Custom Projects Customization About Blog Contact Affiliate Program
Login Get Started Free

Connect with us

AI & Machine Learning v1.0.0 Intermediate

Semantic Project Search & Recommendation Engine

0.0 (0)
0 Downloads
Updated 1 hour ago

A semantic search and recommendation engine that finds coding projects by meaning, not keywords.

Technologies & Skills

Python FastAPI Uvicorn sentence-transformers PyTorch NumPy scikit-learn FAISS Anthropic Claude API Pydantic HTML CSS JavaScript pytest

Tags

ai machine-learning semantic-search nlp embeddings fastapi python recommender-system vector-search full-stack
INR 375
INR 500 25% OFF

Limited time offer

Complete, documented full-stack AI/ML project with clean, modular source code: a semantic search & recommendation engine (FastAPI + sentence-transformers) featuring hybrid search, explainable results, an interactive semantic map, an insights dashboard, and AI-generated briefs. Includes setup/usage guides, a full API reference with Swagger docs, a project report, and a passing pytest suite — and it runs out of the box with an automatic offline fallback.

What's Included

Complete Source Code
Documentation
Project Report
Presentation Slides
External Download Link

Support & Customization

Support: Standard
Custom modifications not available
File Size 329.20 MB
Last Updated Jul 02, 2026

Resource Links

Purchase this project to unlock source and premium resources. Document/report remain secure preview-based on this page.

Overview

SemanticFinder is a Semantic Project Recommendation & Search Engine. Instead of matching keywords, it converts a catalogue of coding projects and the user's natural-language query into high-dimensional vector embeddings (via sentence-transformers) and ranks results by cosine similarity — so a search like "detect objects in images with deep learning" surfaces the right projects even when they share no words with the query.


How it works

On startup, every project's title, description and tags are embedded into a normalised vector and stacked into an in-memory matrix. A query is embedded the same way, and relevance is the cosine similarity between vectors. The top matches are returned with scores and short "why it matched" explanations. A hybrid mode blends this semantic score with a TF-IDF keyword score for the best of both. "More like this" reuses a project's own vector as the query for item-to-item recommendations.


Key features

- Semantic, keyword, and hybrid search with difficulty/language filters

- Match explanations and per-result relevance scores

- Item-to-item recommendations

- Interactive semantic map (PCA projection + k-means theme clusters)

- Insights dashboard (language, difficulty, and topic breakdowns)

- AI-generated project briefs and auto-tags (Claude) with heuristic fallbacks

- Automatic fallback embedding backend so the app always runs

- Light/dark themes, keyboard shortcuts, and a shortlist — no framework or build step


Tech stack

Python, FastAPI, sentence-transformers, PyTorch, NumPy, scikit-learn, FAISS (optional), Anthropic Claude (optional), and a vanilla HTML/CSS/JS frontend. Covered by an automated pytest suite.


Who it's for

Learners and developers exploring smart search, recommendations, and applied NLP/embeddings — and anyone who wants a clean, well-documented reference implementation of a semantic search engine.

Future Enhancements

- Precomputed/persisted embedding cache for instant startup

- Personalised recommendations from search history and saved items

- Larger, live dataset ingested from a real projects API

- Hosted public demo with a shareable URL

- Cross-encoder re-ranking for even sharper relevance

- Server-side user accounts and saved collections

Known Issues

- The optional sentence-transformers/PyTorch stack is large; on very new Python versions or low-memory machines it may fail to install — the app then falls back to a lightweight built-in embedder (lexical rather than fully semantic matching).

- First run downloads the embedding model (~80 MB), so initial startup is slower; later runs are fast (cached).

- LLM auto-tagging/briefs need an ANTHROPIC_API_KEY; without one, a deterministic heuristic is used.

Installation

1. Ensure Python 3.9+ is installed (check: python --version).

2. In the project folder, create a virtual environment:

  python -m venv .venv

3. Activate it:

  - Windows (PowerShell): .venv\Scripts\Activate.ps1

  - macOS/Linux: source .venv/bin/activate

4. Install core dependencies (lightweight, works everywhere):

  pip install -r requirements.txt

5. (Recommended) Install the full semantic engine + extras:

  pip install -r requirements-optional.txt

6. Run the app:

  python run.py

7. Open http://127.0.0.1:8000 in your browser (API docs at /docs).


Note: If sentence-transformers can't install on your Python version, the app auto-falls back to a built-in embedder and still runs. No API keys required; optional LLM features activate only if you set ANTHROPIC_API_KEY in a .env file.

Usage

1. Search: type a natural-language query (e.g. "beginner-friendly Python data visualization") and press Search. Results are ranked by meaning, each with a match % and the topics it matched on.

2. Modes: switch between Hybrid (semantic + keyword), Semantic (embeddings only), or Keyword (TF-IDF). Filter by difficulty and language.

3. Recommendations: "Similar" finds related projects; "Starter brief" generates an AI build plan (stack, steps, stretch ideas); "Auto-tag" produces an AI summary and tags.

4. Map: the Map tab places every project by meaning — closer = more similar, colored by auto-discovered theme. Click a point for related projects.

5. Insights: the Insights tab shows a dashboard of the dataset (counts by language, difficulty, and top topics).

6. Extras: save projects with the star (shortlist), toggle light/dark (top-right), press "/" to focus search.


The same features are available as JSON endpoints — see /docs. Example:

curl -X POST http://127.0.0.1:8000/api/search -H "Content-Type: application/json" -d '{"query":"detect objects in images","top_k":3}'

System Requirements

- OS: Windows, macOS, or Linux

- Runtime: Python 3.9+ (tested on Python 3.14)

- RAM: 2 GB minimum; 4 GB+ recommended with sentence-transformers

- Disk: ~150 MB core; ~2–3 GB with the optional PyTorch/sentence-transformers stack

- Network: needed once to download the embedding model (~80 MB) on first run; runs offline afterward

- Browser: any modern browser (Chrome, Edge, Firefox, Brave)

- Optional: an Anthropic API key for live LLM auto-tagging/briefs (a built-in fallback works without one)

No Q&A available yet

Be the first to ask a question!

Ask a Question

Customer Reviews

0.0 0 reviews
5
0
4
0
3
0
2
0
1
0

Write Your Review

No reviews yet

Be the first to review this project!

Related

Similar Projects

You might also be interested in these projects

sentiment analysis with sarcasm
AI & Machine Learning
0.0 (0)
Advanced
S
Shruti Soni
Verified Seller
90% OFF

sentiment analysis with sarcasm

sarcasm detection ia a advanced NLP task that typically requires building a custom machine learning and deep learning model

tkinter ui Deep learning
₹1,000 ₹10,000
View Project
AI Landmark Lens - Smart Image Recognition System
AI & Machine Learning
0.0 (0)
Advanced
M
Md Asra
Verified Seller

AI Landmark Lens - Smart Image Recognition System

AI travel assistant that identifies landmarks from images and provides history, hotels, food, attractions, and travel recommendations.

Python FastAPI Node.js +10
Software Behaviour Predictor
AI & Machine Learning
0.0 (0)
Intermediate
P
PALLA SIVA
Verified Seller
20% OFF

Software Behaviour Predictor

Software issue tracking systems collect large volumes of text-based bug reports and user complaints

HTML CSS Python +4
₹3,999 ₹4,999
View Project
AI PDF Assistant – RAG Chatbot using LlamaIndex & Qdrant
AI & Machine Learning
FREE
0.0 (0)
Advanced
S
Shanmukh Singampalli
Verified Seller

AI PDF Assistant – RAG Chatbot using LlamaIndex & Qdrant

An AI-powered RAG chatbot that answers questions from uploaded PDF documents using vector search and LLMs.

Python Streamlit LlamaIndex +6