RedditGPT
Fine-tuning GPT-2 model to generate and summarize content based on Reddit data.
Tennis Gym
Reinforcement learning for the tennis gym - Multi-DDPG agent
Reacher Gym
Reinforcement learning for the reacher gym - DDPG
AI Real Estate Agent
Developing AI-powered tools to optimize real estate search and engagement using RAG, instruction fine-tuning with OpenAI API.
Instruction-Tuning-LLM
Fine-tuning large language models for improved instruction following and domain-specific tasks.