Skip to content
View andimarafioti's full-sized avatar
  • Hugging Face
  • Bern, Switzerland

Highlights

  • Pro

Organizations

@huggingface @tifgan

Block or report andimarafioti

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
andimarafioti/README.md

Hi, I’m Andi 👋

I’m an engineer who aspires to be a scientist. I work on multimodal AI, with a strong focus on vision-language models, speech systems, and efficient on-device inference.

I currently work at Hugging Face, where I lead our multimodal research and contribute to projects spanning:

  • Vision-Language Models (VLMs)
  • Speech-to-speech and conversational systems
  • Multimodal research with an emphasis on efficiency and real-world deployment
  • Robotics-facing AI systems

I enjoy building things that are both technically solid and actually usable, from research code to demos and production-ready tools.

What you’ll find here

  • Research prototypes and experimental ideas
  • Open-source tools and demos
  • Work around multimodal models, audio, and vision
  • Occasional side projects

Background

  • PhD in applied machine learning (speech and generative models)
  • Former senior ML engineer at Unity
  • Interested in small, fast, and well-engineered models

Feel free to explore, fork, or reach out.

Pinned Loading

  1. huggingface/speech-to-speech huggingface/speech-to-speech Public

    Speech To Speech: an effort for an open-sourced and modular GPT4-o

    Python 4.4k 502

  2. huggingface/nanoVLM huggingface/nanoVLM Public

    The simplest, fastest repository for training/finetuning small-sized VLMs.

    Python 4.6k 462

  3. florence2-finetuning florence2-finetuning Public

    Quick exploration into fine tuning florence 2

    Jupyter Notebook 339 30

  4. tifgan/stftGAN tifgan/stftGAN Public

    TiFGAN: Time Frequency Generative Adversarial Networks

    Jupyter Notebook 122 13

  5. GACELA GACELA Public

    Generative adversarial context encoder for audio inpainting

    Jupyter Notebook 26 4

  6. tifresi tifresi Public

    STFT transforms suitable for use with PGHI (phase gradient heap integration)

    Python 15 1