World's most capable AI for software development
Member of Engineering – Pre-training, Data Acquisition
Location
United States
Posted
23 days ago
Salary
0
Seniority
Senior
Job Description
Member of Engineering – Pre-training, Data Acquisition
poolside
• Design, build, and operate a large-scale web crawler responsible for acquiring all openly accessible data on the internet • Develop specialized deep crawlers targeting high-value sources to improve recall and coverage • In collaboration with data researchers, own a long-term road map for data acquisition • Build observability, monitoring, and debugging tooling to ensure reliability and transparency across crawl infrastructure • Collaborate with pre-training, post-training, and evaluations teams to align data acquisition priorities with model training needs • Build high-throughput ingestion pipelines for rapidly onboarding partner data and evaluating it for quality
Job Requirements
- Strong distributed systems background with proven experience building and operating large-scale infrastructure — data pipelines, web crawlers, or similar
- Proficiency in Python, and comfortable optimizing performance and debugging complex systems under production conditions
- Hands-on experience with web crawling or large-scale data extraction: understanding of HTTP protocols, distributed job queues, and data parsing at scale
- Familiarity with cloud platforms (AWS) and container orchestration (Kubernetes, Docker) for deploying and managing high-throughput workloads
- Awareness of the non-technical dimensions of internet-scale crawling: data privacy, robots.txt adherence, and responsible crawl practices
- Nice to have:
- Prior experience pre-training LLMs
- Experience in building trillion-scale SOTA pre-training datasets
- Experience translating research to production at scale
Benefits
- Fully remote work & flexible hours
- 37 days/year of vacation & holidays
- 16 weeks of flexible, full-pay parental leave
- Health insurance allowance for you & dependents
- Company-provided equipment
- Well-being, always-be-learning & home office allowances
- Frequent team get togethers
- Diverse & inclusive people-first culture
Related Guides
Related Job Pages
More Software Engineer Jobs
Senior NodeJS Developer
Hunt StWe help Aussie companies find top 3% remote talent in the Philippines & Nepal for a single finder's fee.
Role Description We are seeking a highly experienced Senior NodeJS Developer to join our team. In this role, you will be responsible for architecting and delivering scalable, complex backend systems that integrate with multiple services and handle sophisticated decision logic over time. You’ll work closely with product managers, frontend engineers, and infrastructure teams to build robust applications that are maintainable, secure, and performant. Some of the functionality is written in Python, therefore having strong Python development skills is highly advantageous. Key Responsibilities - Architect, design, and maintain complex backend services using NodeJS and TypeScript - Develop new features and enhance existing systems with a focus on scalability, reliability, and maintainability - Collaborate with frontend developers to integrate user-facing elements with backend APIs - Write efficient, high-quality, and well-tested code, with an emphasis on long-term maintainability - Optimise application performance, data flow, and resource usage across distributed systems - Lead and participate in code reviews; provide mentorship to junior developers - Contribute to architecture and design discussions, influencing technical direction - Demonstrate strong understanding of how applications operate within infrastructure (e.g., monitoring, fault tolerance, scaling) - Ensure application security and data protection best practices are embedded into all solutions - Debug, test, and troubleshoot issues across multiple integrated systems and platforms - Stay up to date with NodeJS/TypeScript ecosystem updates, tools, and best practices Qualifications - Bachelor’s degree in Computer Science, Information Technology, or related discipline (or equivalent professional experience) - 5+ years of hands-on professional experience in NodeJS backend development, building and owning production-grade systems, including complex system integrations and backend workflows - Strong experience with TypeScript, REST APIs, SQL databases, Git, and modern development workflows - 1+ years experience with Python custom backend development - Proven experience building systems that integrate with multiple external/internal services and handle complex logic/state over time - Experience designing and managing CI/CD pipelines for NodeJS applications (automated testing, secure deployments, rollback strategies) - Solid understanding of system-level concerns such as scalability, concurrency, security, and data integrity - Experience writing unit, integration, and end-to-end tests - Comfortable working in Linux environments - Strong communication skills and a proactive, collaborative approach - Ability to work independently, prioritise effectively, and own backend architecture Desirable Skills - Experience working with containerised environments (Docker, Kubernetes) - Experience with React (for end-to-end collaboration with frontend teams) - Experience with enterprise monitoring tools such as New Relic or Datadog - Familiarity with Databases stored procedures Work Arrangement & Expectations This is a remote role that will be set up as an independent contractor engagement. To ensure alignment and transparency, successful candidates will be expected to: - Disclose any existing ongoing roles or client work - Reflect this engagement on their LinkedIn profile (clearly marked as “Independent Contractor”)
Odoo Developer, Visión Funcional y de Negocio
AITInnovación y tecnología para impulsar el crecimiento empresarial. Juntos, construiremos el futuro digital de tu empresa
• Desarrollar y personalizar módulos en Odoo Community y Enterprise. • Participar en proyectos de implantación, evolución y mantenimiento de Odoo. • Analizar necesidades de cliente y traducirlas en soluciones técnicas viables. • Colaborar con perfiles funcionales y de gestión para definir mejoras y desarrollos a medida. • Realizar integraciones con sistemas externos mediante APIs, webhooks y conectores. • Participar en migraciones entre versiones de Odoo y en procesos de mejora continua. • Desarrollar soluciones orientadas a entornos de retail, e-commerce y operaciones comerciales. • Optimizar procesos relacionados con ventas, inventario, compras, logística, TPV y facturación. • Utilizar herramientas de IA para acelerar análisis, desarrollo, documentación, debugging y automatización de tareas. • Aportar criterio técnico y funcional, proponiendo soluciones en lugar de limitarse a ejecutar tickets. • Documentar desarrollos y mantener buenas prácticas de calidad, orden y escalabilidad.
Software Engineer I
InsightNow is the time to bring your expertise to Insight. We are not just a tech company; we are a people-first company. We believe that by unlocking the power of people and technology, we can accelerate transformation and achieve extraordinary results. Fortune 500 Solutions Integrator with deep expertise in cloud, data, AI, cybersecurity, and intelligent edge. Guiding organizations through complex digital decisions.
Role Description As a Cloud Engineer II you will: - Design and implement scalable, secure, and high-performing cloud solutions across Azure/AWS. - Lead infrastructure deployments using Infrastructure as Code (Terraform, Bicep, etc.). - Design and optimize CI/CD pipelines for automated and reliable deployments. - Implement and manage containerized workloads using Docker and Kubernetes (AKS/EKS). - Apply cloud security best practices (IAM, network security, encryption, governance). - Implement advanced monitoring, logging, and observability solutions. - Perform deep troubleshooting, performance optimization, and cost optimization. - Own end-to-end modules and drive technical delivery. Qualifications - 3-5 years of experience in cloud infrastructure, DevOps, or related roles. - Strong hands-on experience with Azure and/or AWS (multi-cloud is a plus). - Strong expertise in Azure: Azure Virtual Machines, VNet, Azure Storage, Azure Functions, Azure Monitor, Azure DNS, Azure AD, Load Balancers, Traffic Manager, and Application Gateway. - Strong expertise in AWS: EC2, VPC, S3, IAM, Route 53, CloudWatch, Elastic Load Balancer (ALB/NLB), Auto Scaling, and related networking services. Benefits - Freedom to work from another location—even an international destination—for up to 30 consecutive calendar days per year. Company Description Insight is an equal opportunity employer, and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, disability status, protected veteran status, sexual orientation or any other characteristic protected by law. At Insight, we celebrate diversity of skills and experience so even if you don’t feel like your skills are a perfect match - we still want to hear from you!
• Performs activities covering the entire software development lifecycle, from requirements gathering to supporting the final deployment of the features they developed, consistent with agile development processes. • Join the development team, taking on work items to be completed each Sprint; • Carry out development-related tasks such as analysis and design, programming, testing, and requirements management; • Manage their own work and promptly inform the team of any delays or other impediments; • Guide and support developers on the project development team; • Code and integrate software components according to technical specifications, using the project’s defined development tools, programming language, and libraries; • Provide support to the IT team and client users when requested regarding the characteristics and specifics of developed components, modules, and software packages; • Deliver training and knowledge transfer to the client on the developed software to ensure proper system operation; • Implement customizations by developing ABAP code or using the Webdynpro platform; • Program and develop software.



