
Hey, I'm Kosmas, a Senior AI Engineer at saas.group. I went from writing C firmware for IoT devices to designing LLM systems and RAG pipelines that ship to real users across multiple SaaS products.
About me
I'm a 26-year-old software engineer with a passion for Artificial Intelligence. My journey began at the Aristotle University of Thessaloniki, where I received my M.Eng. degree in Electrical and Computer Engineering. I am currently working as a Senior AI Engineer at saas.group.
I started my career writing embedded firmware in C for IoT devices, which taught me to care about things most ML engineers skip: memory constraints, failure modes, and systems that need to work without someone watching. That mindset stuck with me as I moved into AI.
At saas.group, I build the AI features that go into the products: RAG pipelines, agentic assistants, and NLP systems. I own these end-to-end, from the first prototype to production monitoring. I care a lot about evaluation and observability because shipping an LLM feature without them is just hoping it works.
In my spare moments, I enjoy playing video games, playing football, and learning new skills. Specifically, in the last few months, I started learning video editing, and I can't wait to finish my next travel vlog. Until then, grab some popcorn and check out my most recent video from Iceland on my YouTube channel.
I created this website to share my thoughts and connect with people. Feel free to reach out to me on LinkedIn or via email.
My projects
Txtify
Open-source application that converts audio and video files into text using advanced AI models for fast and accurate transcriptions, available for self-hosting.
- Python
- Whisper
- Hugging Face
- DeepL
- FastAPI
JoyFill
Open-source web application to enhance sentences by providing positive suggestions using BERT for text completion and DistilBERT for sentiment analysis.
- Python
- BERT
- FastAPI
- Docker
- Hugging Face
My skills
- Python
- LLMs
- RAG
- NLP
- LangChain
- LangGraph
- LangSmith
- Langfuse
- AWS
- Docker
- Kubernetes
- FastAPI
- Celery
- Redis
- SQL
- PostgreSQL
- pgvector
- MongoDB
- PyTorch
- TensorFlow
- HuggingFace
- sklearn
- NumPy
- Pandas
- Git
- Bash
- JavaScript
- C
- C++
- LaTeX
My experience
Embedded Systems Engineer Intern - NET2GRID
Thessaloniki, Greece
• Connected AWS Services (IoT Core, DynamoDB, API Gateway, Lambda) with IoT devices, enhancing system capabilities. • Developed and managed a Greengrass Group with Core and Edge devices, optimizing system performance. • Built MQTT bridge connections and implemented a Sequential Keras Model for accurate predictions, improving data communication and analysis.
Jul 2021 ‑ Sep 2021Embedded Systems Engineer - NET2GRID
Thessaloniki, Greece
• Collaborated with a 6-member team and developed an embedded firmware library for IoT device ESP32 using C.
Oct 2021 ‑ Jun 2022Freelancer
Remote, Greece
• Delivered Deep Learning projects using NLP, Computer Vision, and Data Analysis for various clients. • Instructed over 50 university students in AI, Maths, and Programming, achieving 95% positive feedback and significantly enhancing student performance.
Jan 2023 ‑ Sep 2023AI Engineer - Cogninn
Larissa, Greece
• Led the development of AutoDL, a web platform managing diverse data inputs (images, text, videos, tabular data), and architected ML/DL pipelines, resulting in increased model training accuracy and significant business impact. • Implemented complex image processing tasks(classification, object detection, segmentation, OCR) and text analysis functionalities(classification, summarization). • Engineered video object detection and tabular regression capabilities, increasing processing speed by 15%. • Deployed RESTful API endpoints using FastAPI and integrated MongoDB and SQLite for effective data storage.
Sep 2023 ‑ Aug 2024Senior AI Engineer - saas.group
Remote, Anywhere
• Building and maintaining AI features across multiple SaaS brands, owning each from initial design through deployment and production monitoring. • Shipping RAG pipelines, agentic assistants, and NLP features that serve real users daily, with Langfuse-based observability and evaluation suites gating every release. • Reducing LLM latency and costs while keeping outputs reliable, using structured evaluation, tracing, and runtime metrics to catch regressions before users do.
Nov 2024 ‑ Present (Ongoing)Contact me
Please contact me directly at louiskmeta@gmail.com or through this form.






