目录
Sanjid Hasan Titles

Profile Views GitHub Followers GitHub Stars


🧑‍💻 Who Am I?

class SanjidHasan:
    def __init__(self):
        self.username    = "Sanjidh090"
        self.education   = {
            "degree":      "B.Sc. in Electrical & Electronic Engineering (3rd Year)",
            "institution": "Khulna University of Engineering & Technology (KUET)",
            "country":     "Bangladesh"
        }
        self.expertise   = [
            "Deep Learning & Neural Networks",
            "Computer Vision & NLP",
            "Automatic Speech Recognition (ASR)",
            "Data Science & Analytics",
            "Embedded Systems & PCB Design"
        ]
        self.current_focus = [
            "LLM Fine-tuning", "Bengali Low-Resource NLP",
            "Generative AI", "Edge AI Systems"
        ]
        self.open_to     = [
            "Research Collaborations", "Internships",
            "Freelance / Contract Work", "Full-time (Post-Graduation)"
        ]
        self.philosophy  = "True mastery is demonstrated by the ability to teach."

    def say_hi(self):
        print("Thanks for visiting! Let's build something amazing together 🚀")

me = SanjidHasan()
me.say_hi()

🏆 Achievements at a Glance

Achievement Details
📄 arXiv Published Researcher Make It Hard to Hear, Easy to Learn — Long-Form Bengali ASR & Diarization (Feb 2026)
🏅 Kaggle Expert Notebook Expert + Dataset Expert
🗺️ Google Maps Level 7 Local Guide 1,000+ contributions · Digital heritage preservation
🔬 Former Research Assistant Computer Vision @ Hackules Inc.
Team Frozen Voltage — Founder Active multidisciplinary tech team @ KUET

📄 Research & Publications

🔬 arXiv Preprint — February 2026

Make It Hard to Hear, Easy to Learn: Long-Form Bengali ASR and Speaker Diarization via Extreme Augmentation and Perfect Alignment Sanjid Hasan, Risalat Labib, A H M Fuad, Bayazid Hasan

Submitted to DL Sprint 4.0 @ BUET CSE Fest 2026. Introduces Lipi-Ghor-882 — a comprehensive 882-hour multi-speaker Bengali speech dataset — and demonstrates that targeted fine-tuning with synthetic acoustic degradation outperforms raw data scaling for low-resource ASR.

Key Contributions:

  • 🗃️ Lipi-Ghor-882 Dataset — 882 hours of multi-speaker Bengali audio (HuggingFace: Lipi-Ghor-bn-882-SSTT)
  • 🎙️ Whisper-based ASR pipeline with extreme augmentation strategy
  • 👥 Speaker diarization revealing failure modes of SOTA open-source models on Bengali data

📊 Public Datasets

Dataset Platform Domain
Lipi-Ghor-bn-882-SSTT HuggingFace Bengali ASR / Speaker Diarization
Bangladesh News Dataset Kaggle · ResearchGate NLP / Text Classification
KUET Whispers Dataset Facebook Analytics Social Media Speech
BUETIAN RAPSODY Dataset Social Media Mining Multimodal
Google Maps Dataset — Khulna Geospatial Location Intelligence

🎖️ Competition Record

Competition Team Result Category Year
BUET CSE Fest — DL Sprint 4.0 Team Villagers 🥉 2nd Runner-Up + Dataset Award National Datathon 2026
AgriYield 2025 Solo 🥉 3rd Place Crop Yield Prediction 2025
Legends of Logic Team Frozen Voltage 3rd & 5th Math History Chronicles 2025
First Byte Datathon Team Frozen Voltage 8th / Top 10 Intra-KUET Datathon 2025
NWU CSE Fest Datathon Team Maverics 4th Place National Competition 2025
CUET CSE Fest 2025 Team Envisage Top 60 Political Meme Detection (80% F1) 2025
CUET ETE Televerse 2025 Team Maverics Top 40 Shobdotori NLP Challenge 2025

💡 Note: AgriYield 3rd place was a solo submission — no team, no shortcuts.


💻 Technical Stack

🧠 AI / ML & Data Science

Python PyTorch TensorFlow scikit-learn Keras Pandas NumPy OpenCV Hugging Face

💾 Languages

C C++ Python JavaScript HTML5

🛠️ Tools & Platforms

Git GitHub Jupyter VS Code Kaggle Google Colab Notion

🎯 Specialized Domains

  • ASR & Speech: Whisper fine-tuning, long-form Bengali speech, speaker diarization, VAD pipelines
  • Computer Vision: CNNs, Object Detection, Neural Style Transfer
  • NLP: Transformers, BERT, mBERT, low-resource Bengali NLP
  • Generative AI: LLM fine-tuning, Prompt Engineering, RAG
  • Embedded Systems: PCB design & fabrication, hybrid C/Python architecture

📚 Academic Repositories

Repo Course Description
CSE-2132 Algorithms & Data Structures Week-by-week DSA in C++ — Linked Lists, Stacks, Sorting, Hashing
EE-1222 Embedded Systems Product Management & Billing System · Custom PCB design

✍️ Technical Writing

DEV.to Medium

Article Platform Topic
Painting with Pixels: The Mathematics of Style Transfer Medium Gram Matrices, loss functions, artistic AI
What is Invariance? DEV.to Translation, Rotation & Scale Invariance in CNNs
Gigabytes vs. Gibibytes DEV.to Decimal vs. binary storage standards
The Enigma of </> DEV.to Coding culture & semiotics

“I don’t simply consume technology — I deconstruct it, optimize it, and teach it.”


⚡ Team Frozen Voltage

“Frozen but not forgotten. New goals. New energy. Same fire.” ❄️🔥

An active, multidisciplinary student team at KUET pushing boundaries across AI, robotics, embedded systems, and competitive data science.

Core Members: Sanjid Hasan (Founder & AI Lead) · Golam Rabby (Data Science & Math) · Shahriar Kamal (Co-representative)

What we build:

  • 🤖 Robotics — Line-following robots with custom PCB design
  • 🧬 Biomedical — Adaptive BAMs (Bioresorbable Acoustic Microrobots) @ SciBlitz 1.0
  • 📊 Datathons — Consistent national top-10 finishes
  • 📖 Technical writing — Award-winning math & literature content

🗺️ Google Maps — Level 7 Local Guide

100+ Followers · 1,000+ Contributions · Level 7

“I view my Maps profile as a digital archive — preserving the vibes and cultures of places that have been removed or closed. These photos become relics on maps, memories that would otherwise be lost to time.”

From quiet corners of KUET to the streets of Narsingdi and Khulna — one contribution at a time.


🎓 Certifications

  • AI Agents Intensive Course — Google (2025)
  • AIML Workshop — INDcon 2025 @ MIST
  • Python — Kaggle Learn
  • Intro to Machine Learning — Kaggle Learn

📊 GitHub Analytics

Productive Time Contribution Graph Snake animation

🎯 2026 Focus

const focus2026 = {
    research:     ["Bengali Low-Resource NLP", "Long-Form ASR", "Speaker Diarization"],
    learning:     ["Reinforcement Learning", "Vision Transformers", "Edge AI"],
    building:     ["Production-grade ASR pipelines", "Embedded AI systems", "Open datasets"],
    contributing: ["Open Source", "Technical Writing", "Community Education in Bangladesh"],
    goals:        ["Publish more research", "Top-10 Kaggle competitions", "Tech conference talks"]
};

🌐 Connect

GitHub Kaggle LinkedIn Gmail DEV.to Medium YouTube HuggingFace arXiv Google Maps


Open to research collaborations · internships · freelance work · full-time opportunities

Made with ❤️ and ☕ by Sanjid Hasan · Last Updated: March 2026

邀请码
    Gitlink(确实开源)
  • 加入我们
  • 官网邮箱:gitlink@ccf.org.cn
  • QQ群
  • QQ群
  • 公众号
  • 公众号

版权所有:中国计算机学会技术支持:开源发展技术委员会
京ICP备13000930号-9 京公网安备 11010802032778号