Profile photo

Georgios Peikos

Search and NLP Researcher (RTDA), University of Milano Bicocca · ex Marie Curie ESR (DoSSIER)
Focus · LLMs, Retrieval Augmented Generation, Semantic Search, Ranking and Evaluation.

About

I am an Applied AI Engineer and Search/NLP Researcher at the University of Milano-Bicocca, with a background in Electrical and Computer Engineering. My work focuses on Large Language Models, Retrieval Augmented Generation, and Information Retrieval, with applications in healthcare and other high reliability domains.

I design and deploy end-to-end AI systems that combine data pipelines, semantic retrieval, reranking, model training, inference, and evaluation. My recent work includes evidence-grounded RAG systems, conversational search, clinical trial retrieval, and trustworthy AI workflows.

I have experience training and adapting small and mid-scale language models, including 1B-7B parameter models, using PyTorch DDP on Slurm-managed HPC infrastructure. System design is guided by real-world constraints such as evidence grounding, privacy, efficiency, and expert oversight.

I also teach Information Retrieval and Recommender Systems at Bachelor’s and Master’s level, with an emphasis on evaluation and practical system building.

I am open to collaborations and technical roles involving the design and evaluation of applied AI systems, particularly in healthcare, enterprise search, legal search, and other domains where reliability, grounding, and evaluation matter.

Focus

RAG pipelines: retrieval, reranking, query understanding, and context construction
Large Language Models: training, fine-tuning, context understanding, and generation
(LLM/IR) System Evaluation and system diagnostics: offline metrics, error analysis, and user-oriented evaluation
Trustworthy AI constraints: evidence grounding, privacy, efficiency, and expert oversight

Programmes & Projects

Selected work showing how I design, build, and evaluate retrieval, NLP, and applied AI systems, with links to code, papers, and prototypes where available.

Programme Active Research
Trustworthy health search and privacy preserving clinician in the loop retrieval, developed at UniMiB within the ANTHEM programme.
Impact: Prototyped and evaluated healthcare AI under real constraints: privacy, low compute, expert oversight, and evidence grounded outputs.
Keywords: healthcare AI, conversational AI, RAG, health information retrieval, clinical trials retrieval, privacy, evaluation, human in the loop
Programme Completed Research
Marie Curie PhD research on professional search in health and legal domains, focusing on interaction aware and multidimensional relevance.
Impact: Delivered PhD outputs and applied prototypes for professional search, including clinical trial matching and retrieval methods for medical and legal tasks.
Keywords: information retrieval, professional search, medical IR, legal IR, multidimensional relevance, query representation, evaluation
Tool Maintenance Research Teaching
A visual analytics tool for evaluating and comparing search systems beyond average metrics, developed at UniMiB.
Impact: Delivered a published, open-source visual analytics tool used in IR teaching and applicable as a proof-of-concept for business search evaluation and debugging.
Keywords: search evaluation, information retrieval, ranking, visual analytics, IR experimentation, diagnostics, Streamlit, statistical testing
Case study Active Hobby
A personal vintage retail hobby that gives me a practical setting to build small tools for real workflow automation.
Impact: Developed tools that support my own hobby workflow, including price estimation, inventory handling, promotions, and other recurring operational tasks.
Keywords: applied AI, customer experience, marketing, pricing, unit economics, automation, experimentation, eBay

Selected Publications

Selected publications (most recent and most cited). Full list available on Google Scholar.

Most recent

Most influential

Recent Publication
Fact-Driven Health Information Retrieval: Integrating LLMs and Knowledge Graphs to Combat Misinformation
Gian Carlo Milanese, Georgios Peikos, Gabriella Pasi, Marco Viviani · European Conference on Information Retrieval · 2025
Influential Publication
Utilizing ChatGPT to Enhance Clinical Trial Enrollment
Georgios Peikos, Symeon Symeonidis, Pranav Kasela, G. Pasi · arXiv.org · 2023
Recent Publication
ASPIRE: Assistive System for Performance Evaluation in IR
Georgios Peikos, Wojciech Kusa, Symeon Symeonidis · European Conference on Information Retrieval · 2025
Influential Publication
LeiBi@COLIEE 2022: Aggregating Tuned Lexical Models with a Cluster-driven BERT-based Model for Case Law Retrieval
Arian Askari, Georgios Peikos, G. Pasi, Suzan Verberne · arXiv.org · 2022
Recent Publication
A Decision-Theoretic Framework to Multidimensional Relevance Estimation
Georgios Peikos, Gabriella Pasi · International Journal of Information Technology & Decision Making · 2025
Influential Publication
Leveraging Large Language Models for Medical Information Extraction and Query Generation
Georgios Peikos, Pranav Kasela, Gabriella Pasi · 2024 IEEE/WIC International Conference on Web Intelligence and Intelligent Agent Technology (WI-IAT) · 2024

Teaching

I teach Information Retrieval and Recommender Systems at the University of Milano-Bicocca, with a focus on evaluation, retrieval models, modern search systems, and practical system building.

External Tutorials & Workshops

I also deliver external tutorials and workshops on NLP, Information Retrieval, and applied AI topics for research schools and professional audiences.

Contact

If you would like to discuss collaborations, research, or applied AI projects, you can reach me on LinkedIn: /in/peikosgeorgios

Final Note I would like to thank all the people who have accompanied me during the activities presented above and, over the years, have shaped how I think and work.