logo
Description

About Us: We are a forward-thinking AI-driven company dedicated to pushing the boundaries of large-scale language models and their applications. We specialize in building cutting-edge AI solutions that leverage the latest advances in machine learning, NLP, and deep learning technologies. Join our dynamic and innovative team to help shape the future of AI.

Job Summary: We are seeking an experienced LLM Engineer with at least two years of hands-on experience in training and fine-tuning large language models (LLMs) from scratch. The ideal candidate will have a Master’s degree in Machine Learning or a related field and be proficient in designing, building, and optimizing state-of-the-art LLMs. If you are passionate about AI, NLP, and large-scale model architectures, and have a proven track record in training models to perform at scale, we want to hear from you!

Requirements

Design, develop, and train large language models (LLMs) from scratch using cutting-edge machine learning techniques.
Optimize model architecture and hyperparameters for performance, scalability, and efficiency.
Collaborate with cross-functional teams including data scientists, engineers, and researchers to develop robust AI solutions.
Evaluate model performance, apply advanced fine-tuning techniques, and ensure models generalize well across tasks.
Research and implement the latest advancements in NLP, machine learning, and large-scale models.
Utilize deep learning frameworks such as PyTorch or TensorFlow to experiment with new model architectures.
Contribute to the deployment and scaling of trained models in production environments.
Stay up to date with the latest trends and developments in large-scale language models and machine learning.

Skills

Master’s degree in Machine Learning, Artificial Intelligence, Computer Science, or a related field.
Minimum of 2 years of experience in training and fine-tuning large language models (LLMs) from scratch.
Proven experience with deep learning frameworks such as PyTorch, TensorFlow, or JAX.
Strong understanding of NLP techniques, transformers, and the inner workings of large language models.
Solid programming skills in Python and experience with ML libraries.
Experience in optimizing models for performance and scalability.
Familiarity with distributed training techniques and cloud platforms (e.g., AWS, GCP, Azure).
Excellent problem-solving skills and attention to detail.
Strong communication skills and the ability to work in a collaborative environment.
Preferred Skills:

Experience with reinforcement learning, unsupervised learning, or generative models.
Prior experience in deploying LLMs in production environments.
Knowledge of prompt engineering and fine-tuning for specific applications.
Contribution to open-source machine learning projects.

Work on groundbreaking projects in the field of AI and NLP.
Collaborate with a talented and passionate team of researchers and engineers.
Competitive salary and benefits.
Opportunity for continuous learning and professional development.
Flexible work environment and potential for remote work.
How to Apply: Please send your resume and cover letter to [contact email]. We look forward to exploring how your expertise can contribute to our mission!