AI Software Engineer

05/07更新
積極徵才中
19 小時前聯絡過求職者

工作內容

a. Job Description: We are looking for an AI Software Engineer with expertise in performance profiling, model fine-tuning, and architecture identification for large-scale AI models such as Llama, DeepSeek, VLM, Evo-2, LLMs, Robot VLM, and DNA-based models. The ideal candidate has experience with deep learning frameworks, hardware acceleration, and AI optimization techniques to enhance the efficiency and scalability of AI models. b. Performance Profiling & Optimization Analyze and optimize AI model performance across various hardware platforms (GPUs, TPUs, NPUs). Profile training and inference pipelines for memory usage, compute efficiency, and latency. Work with CUDA, TensorRT, PyTorch, and JAX to optimize models for production deployment. Implement quantization, pruning, distillation, and mixed-precision training techniques. Debug performance bottlenecks using NVProf, Nsight, TensorBoard, and PyTorch Profiler. 2. Architecture Identification & Model Analysis Reverse-engineer and analyze LLM/VLM architectures to extract key architectural details. Identify activation functions, attention mechanisms, and parameter distributions in pretrained models. Compare performance trade-offs between transformer-based, mixture-of-experts (MoE), and diffusion models. Work on model compression techniques for real-time AI applications. 3. Fine-Tuning & Customization Fine-tune LLMs, VLMs, and DNA-based AI models on domain-specific datasets. Use LoRA, QLoRA, PEFT, and Adapter methods for parameter-efficient fine-tuning. Implement prompt engineering, retrieval-augmented generation (RAG), and reinforcement learning (RLHF) for LLMs. Train and deploy robotic VLMs for real-world AI applications. c. Qualifications: Bachelor’s/Master’s/PhD in Computer Science, Machine Learning, AI, or related fields. 3+ years of experience in deep learning, model optimization, and AI performance engineering. Strong proficiency in Python, PyTorch, TensorFlow, and JAX. Experience with CUDA, Triton, TensorRT, and ONNX for AI acceleration. Understanding of transformer architectures, vision-language models (VLMs), and multi-modal AI. Familiarity with LLM fine-tuning techniques, large-scale distributed training, and RLHF. Experience working with scientific AI applications (e.g., DNA sequencing, robotics, computational biology). - Non smoking

工作待遇

待遇面議

(經常性薪資達 4 萬元或以上)

工作性質

全職

上班地點

新竹縣竹北市(依照公司規定分派)

管理責任

不需負擔管理責任

出差外派

無需出差外派

上班時段

日班

休假制度

依公司規定

可上班日

不限

需求人數

不限

條件要求

工作經歷

不拘

學歷要求

大學以上

科系要求

不拘

語文條件

不拘

擅長工具

C++C#

其他條件

未填寫

歡迎所有求職者,與
應屆畢業生
外籍人士

公司環境照片(1張)

福利制度

優於勞基法之福利制度,

聯絡方式

聯絡人

HR

應徵回覆

合適者將於7個工作天內主動聯繫,不合適者將不另行通知
104人力銀行提醒您履歷關閉時仍可投遞履歷喔!面試時請遵守求職禮儀準時赴約並小心安全
求職安全專線【勞動部】0800-085-151【104人力銀行】02-29126104轉2 或來信詢問
建議使用104內建訊息功能,以保障您的求職權益,職缺內容可能包含第三方通訊軟體,敬請謹慎評估。
職場安全提醒

適合你大展身手的工作

智能客服
您好,我是您的智能客服 找頭鹿有任何問題都可以問我喔!