Skip to content
View nguyenpavel's full-sized avatar

Highlights

  • Pro

Block or report nguyenpavel

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
nguyenpavel/README.md

👋 Hi, I’m Pavel

AI Engineer • Data & Product Builder

profile-views

🔗 Connect with me


🚀 What I do

  • Ship AI products end-to-end: data pipelines → RAG/LLMs → UX → deployment
  • Build predictive models for forecasting, churn/LTV, risk scoring, anomaly detection
  • Blend Python, TypeScript, dbt, BigQuery, GCP/AWS, Docker to deliver reliable, observable systems
  • Love LLM evals, prompt tooling, vector search, analytics, and automation

Right now

  • 🔭 Building AI compliance tooling and data apps
  • 💬 Ask me about React/TS, Node, Python, data pipelines, RAG, OpenAI, LangChain

🧰 Tech Stack

🗣️ Languages

📚 Frameworks & Libraries

🧱 Data & Storage

🛠️ Tools

☁️ Cloud & Environments

📊 BI & Analytics / Software


🌟 Featured Projects

Project highlights (click to expand)

🧠 LLM-Assisted Grocery Sales Forecasting

  • Benchmarks SARIMA, Neural Prophet, LSTM, RF, MLP with GPT-4 vs Claude-3 in autonomous vs assistive modes
  • Representative result: Neural Prophet RMSLE ≈ 0.1458 (GPT-4, role: Data Scientist); explores prompt-role/sentiment effects
  • Repro tips: deterministic seeds, leakage guards, schema validation

🎲 BoardGame RAG (AWS-native)

  • Serverless RAG over 150k+ games + rulebook PDFs
  • Kendra retrieval + Bedrock (Claude), optional OpenSearch vectors, Cognito auth, Amplify UI
  • Glue/Step Functions ingest; CI/CD via CodeBuild/CodePipeline; cost/monitoring guardrails

🧹 Multi-Platform Spam Classification

  • Spam detectors across email/SMS/YouTube using TF-IDF+NB, FastText+MLP, DistilBERT
  • In-domain: DistilBERT up to 0.992 acc; under domain shift all models drop—analysis of precision/recall trade-offs
  • Future: domain adaptation, hard-negative mining, lightweight adapters

💳 Loan Default Prediction

  • LendingClub 1.3M rows; application-time only (no leakage)
  • ROC AUC ≈ 0.73 with Random Forest/XGBoost; engineered features (FICO composite, credit history length, installment/income)
  • Next: calibrated probabilities, cost-aware thresholding, LightGBM/CatBoost

🤝 Let’s collaborate

I enjoy shipping useful tools for data-heavy teams. If you’re exploring AI copilots, RAG systems, analytics platforms, or AI automation, ping me — happy to chat.

“Ship small, measure, iterate.”

Popular repositories Loading

  1. london-house-prediction london-house-prediction Public

  2. dojo-bin-generator dojo-bin-generator Public

    Forked from node-dojo/dojo-recursive-bins

    3d printable bin generator using blender geometry nodes

  3. rag-with-llm rag-with-llm Public

    This is a project building a local RAG system with LLM

    Python

  4. linkedIn_auto_jobs_applier_with_AI linkedIn_auto_jobs_applier_with_AI Public

    Forked from feder-cr/Jobs_Applier_AI_Agent_AIHawk

    LinkedIn_AIHawk is a tool that automates the jobs application process on LinkedIn. Utilizing artificial intelligence, it enables users to apply for multiple job offers in an automated and personali…

    Python

  5. SunoApi_Fork SunoApi_Fork Public

    Forked from AICodeHunt/SunoApi

    Python

  6. llama-ocr llama-ocr Public

    Forked from Nutlope/llama-ocr

    Document to Markdown OCR library with Llama 3.2 vision

    TypeScript