<b>Overview</b><br><p><span xml:lang="EN-US" data-contrast="none">We are the Azure AI Inferencing team, part of the CoreAI Foundry business that builds and runs the model-serving platform including, but not limited to, large OpenAI generative models. This team is at the forefront of delivering this innovation with massive scale powering every Azure OpenAI customer and scenario in the industry, both 3P and 1P customer like Bing and Office and solve exciting problems on the intersection of AI and Cloud.</span><span data-ccp-props="{"201341983":0,"335559739":0,"335559740":240}"> </span></p><p><span xml:lang="EN-US" data-contrast="none">We are looking for a seasoned Software Engineer, who is passionate about designing and building highly reliable, available platform to support model inferencing at the scale of billions of requests per day. You will be working on high throughput/low latency scenarios and drive performance optimization capabilities.</span><span data-ccp-props="{"201341983":0,"335559739":0,"335559740":240}"> </span></p><p>Microsoft is leading the AI strategy with an ambitious mission to democratize AI, make it an essential ingredient for delivering breakthrough customer experiences and to ensure the benefits of AI reach every person and organization on the planet, safely and responsibly. Our culture is centered on embracing a growth mindset and encouraging teams and leaders to bring their best each day. Join us and help shape the future of the world.</p><br><br><b>Responsibilities</b><br><p><strong><span xml:lang="EN-US" data-contrast="none">Responsibilities</span><span xml:lang="EN-US" data-contrast="none"> :</span></strong></p><div><ul style="list-style-type: disc;" role="list"><li role="listitem" data-aria-level="1" data-aria-posinset="1" data-list-defn-props="{"335552541":1,"335559685":360,"335559991":360,"469769226":"Symbol","469769242":[8226],"469777803":"left","469777804":"","469777815":"hybridMultilevel"}" data-listid="11" data-font="Symbol" data-leveltext="" aria-setsize="-1"><p><span xml:lang="EN-US" data-contrast="none">Lead architecture and design of complex, distributed systems; make key technical decisions and mentor engineers on design tradeoffs and best practices.</span><span data-ccp-props="{"201341983":0,"335559739":0,"335559740":240}"> </span></p></li></ul></div><div><ul style="list-style-type: disc;" role="list"><li role="listitem" data-aria-level="1" data-aria-posinset="2" data-list-defn-props="{"335552541":1,"335559685":360,"335559991":360,"469769226":"Symbol","469769242":[8226],"469777803":"left","469777804":"","469777815":"hybridMultilevel"}" data-listid="11" data-font="Symbol" data-leveltext="" aria-setsize="-1"><p><span xml:lang="EN-US" data-contrast="none">Own solution quality end</span>‑<span xml:lang="EN-US" data-contrast="none">to</span>‑<span xml:lang="EN-US" data-contrast="none">end, including test strategy, security testing, reliability, and operational readiness.</span><span data-ccp-props="{"201341983":0,"335559739":0,"335559740":240}"> </span></p></li></ul></div><div><ul style="list-style-type: disc;" role="list"><li role="listitem" data-aria-level="1" data-aria-posinset="3" data-list-defn-props="{"335552541":1,"335559685":360,"335559991":360,"469769226":"Symbol","469769242":[8226],"469777803":"left","469777804":"","469777815":"hybridMultilevel"}" data-listid="11" data-font="Symbol" data-leveltext="" aria-setsize="-1"><p><span xml:lang="EN-US" data-contrast="none">Drive cross</span>‑<span xml:lang="EN-US" data-contrast="none">team collaboration, identifying dependencies, resolving conflicts, and aligning delivery plans across partner teams.</span><span data-ccp-props="{"201341983":0,"335559739":0,"335559740":240}"> </span></p></li></ul></div><div><ul style="list-style-type: disc;" role="list"><li role="listitem" data-aria-level="1" data-aria-posinset="4" data-list-defn-props="{"335552541":1,"335559685":360,"335559991":360,"469769226":"Symbol","469769242":[8226],"469777803":"left","469777804":"","469777815":"hybridMultilevel"}" data-listid="11" data-font="Symbol" data-leveltext="" aria-setsize="-1"><p><span xml:lang="EN-US" data-contrast="none">Act as DRI for live systems, leading incident response, root</span>‑<span xml:lang="EN-US" data-contrast="none">cause analysis, and prevention through automation and operational improvements.</span><span data-ccp-props="{"201341983":0,"335559739":0,"335559740":240}"> </span></p></li></ul></div><div><ul style="list-style-type: disc;" role="list"><li role="listitem" data-aria-level="1" data-aria-posinset="5" data-list-defn-props="{"335552541":1,"335559685":360,"335559991":360,"469769226":"Symbol","469769242":[8226],"469777803":"left","469777804":"","469777815":"hybridMultilevel"}" data-listid="11" data-font="Symbol" data-leveltext="" aria-setsize="-1"><p><span xml:lang="EN-US" data-contrast="none">Champion security, privacy, compliance, and Responsible AI, establishing security invariants, auditability, and monitoring across the system.</span><span data-ccp-props="{"201341983":0,"335559739":0,"335559740":240}"> </span></p></li></ul></div><div><ul style="list-style-type: disc;" role="list"><li role="listitem" data-aria-level="1" data-aria-posinset="6" data-list-defn-props="{"335552541":1,"335559685":360,"335559991":360,"469769226":"Symbol","469769242":[8226],"469777803":"left","469777804":"","469777815":"hybridMultilevel"}" data-listid="11" data-font="Symbol" data-leveltext="" aria-setsize="-1"><p><span xml:lang="EN-US" data-contrast="none">Lead automation and deployment excellence, enabling scalable, zero</span>‑<span xml:lang="EN-US" data-contrast="none">touch deployments with safe rollout and rollback strategies.</span><span data-ccp-props="{"201341983":0,"335559739":0,"335559740":240}"> </span></p></li></ul></div><div><ul style="list-style-type: disc;" role="list"><li role="listitem" data-aria-level="1" data-aria-posinset="7" data-list-defn-props="{"335552541":1,"335559685":360,"335559991":360,"469769226":"Symbol","469769242":[8226],"469777803":"left","469777804":"","469777815":"hybridMultilevel"}" data-listid="11" data-font="Symbol" data-leveltext="" aria-setsize="-1"><p><span xml:lang="EN-US" data-contrast="none">Raise engineering standards, through code reviews, production-grade telemetry, performance/scalability improvements, and mentoring others.</span></p></li></ul></div><br><br><b>Qualifications</b><br><div><p><strong><span xml:lang="EN-IN" data-contrast="none">Required Qualifications</span><span data-ccp-props="{"201341983":0,"335559739":0,"335559740":240}"> </span></strong></p></div><div><ul style="list-style-type: disc;" role="list"><li role="listitem" data-aria-level="1" data-aria-posinset="1" data-list-defn-props="{"335552541":1,"335559685":360,"335559991":360,"469769226":"Symbol","469769242":[8226],"469777803":"left","469777804":"","469777815":"multilevel"}" data-listid="14" data-font="Symbol" data-leveltext="" aria-setsize="-1"><p><span xml:lang="EN-IN" data-contrast="none">Bachelor’s or Master’s degree in Computer Science, Engineering, Mathematics, or a related field, </span><span xml:lang="EN-IN" data-contrast="none">or equivalent industry experience</span><span xml:lang="EN-IN" data-contrast="none">.</span><span data-ccp-props="{"201341983":0,"335559739":0,"335559740":240,"469777462":[720],"469777927":[0],"469777928":[8]}"> </span></p></li></ul></div><div><ul style="list-style-type: disc;" role="list"><li role="listitem" data-aria-level="1" data-aria-posinset="2" data-list-defn-props="{"335552541":1,"335559685":360,"335559991":360,"469769226":"Symbol","469769242":[8226],"469777803":"left","469777804":"","469777815":"multilevel"}" data-listid="14" data-font="Symbol" data-leveltext="" aria-setsize="-1"><p><span xml:lang="EN-IN" data-contrast="none">15+ years of professional software development experience</span><span xml:lang="EN-IN" data-contrast="none">, building and operating complex systems.</span><span data-ccp-props="{"201341983":0,"335559739":0,"335559740":240,"469777462":[720],"469777927":[0],"469777928":[8]}"> </span></p></li></ul></div><div><ul style="list-style-type: disc;" role="list"><li role="listitem" data-aria-level="1" data-aria-posinset="3" data-list-defn-props="{"335552541":1,"335559685":360,"335559991":360,"469769226":"Symbol","469769242":[8226],"469777803":"left","469777804":"","469777815":"multilevel"}" data-listid="14" data-font="Symbol" data-leveltext="" aria-setsize="-1"><p><span xml:lang="EN-IN" data-contrast="none">Strong foundations in </span><span xml:lang="EN-IN" data-contrast="none">computer science fundamentals</span><span xml:lang="EN-IN" data-contrast="none">, including algorithms, data structures, systems design, and coding proficiency.</span><span data-ccp-props="{"201341983":0,"335559739":0,"335559740":240,"469777462":[720],"469777927":[0],"469777928":[8]}"> </span></p></li></ul></div><div><ul style="list-style-type: disc;" role="list"><li role="listitem" data-aria-level="1" data-aria-posinset="4" data-list-defn-props="{"335552541":1,"335559685":360,"335559991":360,"469769226":"Symbol","469769242":[8226],"469777803":"left","469777804":"","469777815":"multilevel"}" data-listid="14" data-font="Symbol" data-leveltext="" aria-setsize="-1"><p><span xml:lang="EN-IN" data-contrast="none">Proven experience in </span><span xml:lang="EN-IN" data-contrast="none">architecture and design of large-scale software systems</span><span xml:lang="EN-IN" data-contrast="none">, including making sound technical decisions and tradeoffs.</span><span data-ccp-props="{"201341983":0,"335559739":0,"335559740":240,"469777462":[720],"469777927":[0],"469777928":[8]}"> </span></p></li></ul></div><div><ul style="list-style-type: disc;" role="list"><li role="listitem" data-aria-level="1" data-aria-posinset="5" data-list-defn-props="{"335552541":1,"335559685":360,"335559991":360,"469769226":"Symbol","469769242":[8226],"469777803":"left","469777804":"","469777815":"multilevel"}" data-listid="14" data-font="Symbol" data-leveltext="" aria-setsize="-1"><p><span xml:lang="EN-IN" data-contrast="none">Demonstrated expertise in </span><span xml:lang="EN-IN" data-contrast="none">software engineering lifecycle</span><span xml:lang="EN-IN" data-contrast="none">, including design, development, testing, quality assurance, deployment, and live-site operations.</span><span data-ccp-props="{"201341983":0,"335559739":0,"335559740":240,"469777462":[720],"469777927":[0],"469777928":[8]}"> </span></p></li></ul></div><div><ul style="list-style-type: disc;" role="list"><li role="listitem" data-aria-level="1" data-aria-posinset="6" data-list-defn-props="{"335552541":1,"335559685":360,"335559991":360,"469769226":"Symbol","469769242":[8226],"469777803":"left","469777804":"","469777815":"multilevel"}" data-listid="14" data-font="Symbol" data-leveltext="" aria-setsize="-1"><p><span xml:lang="EN-IN" data-contrast="none">Strong </span><span xml:lang="EN-IN" data-contrast="none">problem-solving, systems thinking, and decision-making</span><span xml:lang="EN-IN" data-contrast="none"> skills with high attention to detail.</span></p></li><li role="listitem" data-aria-level="1" data-aria-posinset="6" data-list-defn-props="{"335552541":1,"335559685":360,"335559991":360,"469769226":"Symbol","469769242":[8226],"469777803":"left","469777804":"","469777815":"multilevel"}" data-listid="14" data-font="Symbol" data-leveltext="" aria-setsize="-1"><p><span xml:lang="EN-IN" data-contrast="none">Experience collaborating across teams, handling </span><span xml:lang="EN-IN" data-contrast="none">technical dependency management and conflict resolution</span><span xml:lang="EN-IN" data-contrast="none">.</span></p></li><li role="listitem" data-aria-level="1" data-aria-posinset="6" data-list-defn-props="{"335552541":1,"335559685":360,"335559991":360,"469769226":"Symbol","469769242":[8226],"469777803":"left","469777804":"","469777815":"multilevel"}" data-listid="14" data-font="Symbol" data-leveltext="" aria-setsize="-1"><p><span xml:lang="EN-IN" data-contrast="none">Excellent </span><span xml:lang="EN-IN" data-contrast="none">oral and written communication skills</span><span xml:lang="EN-IN" data-contrast="none"> in English, with the ability to clearly explain complex technical concepts.</span><span data-ccp-props="{"201341983":0,"335559739":0,"335559740":240,"469777462":[720],"469777927":[0],"469777928":[8]}"> </span></p></li></ul><div><p><strong><span xml:lang="EN-IN" data-contrast="none">Preferred Qualifications</span><span data-ccp-props="{"201341983":0,"335559739":0,"335559740":240}"> </span></strong></p></div><div><ul style="list-style-type: disc;" role="list"><li role="listitem" data-aria-level="1" data-aria-posinset="1" data-list-defn-props="{"335552541":1,"335559685":360,"335559991":360,"469769226":"Symbol","469769242":[8226],"469777803":"left","469777804":"","469777815":"multilevel"}" data-listid="15" data-font="Symbol" data-leveltext="" aria-setsize="-1"><p><span xml:lang="EN-IN" data-contrast="none">Experience operating </span><span xml:lang="EN-IN" data-contrast="none">real-time, high-throughput, low-latency services</span><span xml:lang="EN-IN" data-contrast="none"> in production environments.</span><span data-ccp-props="{"201341983":0,"335559739":0,"335559740":240,"469777462":[720],"469777927":[0],"469777928":[8]}"> </span></p></li></ul></div><div><ul style="list-style-type: disc;" role="list"><li role="listitem" data-aria-level="1" data-aria-posinset="2" data-list-defn-props="{"335552541":1,"335559685":360,"335559991":360,"469769226":"Symbol","469769242":[8226],"469777803":"left","469777804":"","469777815":"multilevel"}" data-listid="15" data-font="Symbol" data-leveltext="" aria-setsize="-1"><p><span xml:lang="EN-IN" data-contrast="none">Hands-on experience designing, implementing, testing, and operating </span><span xml:lang="EN-IN" data-contrast="none">Azure AI or large-scale cloud services</span><span xml:lang="EN-IN" data-contrast="none">, meeting performance, scalability, reliability, and compliance requirements.</span><span data-ccp-props="{"201341983":0,"335559739":0,"335559740":240,"469777462":[720],"469777927":[0],"469777928":[8]}"> </span></p></li></ul></div><div><ul style="list-style-type: disc;" role="list"><li role="listitem" data-aria-level="1" data-aria-posinset="3" data-list-defn-props="{"335552541":1,"335559685":360,"335559991":360,"469769226":"Symbol","469769242":[8226],"469777803":"left","469777804":"","469777815":"multilevel"}" data-listid="15" data-font="Symbol" data-leveltext="" aria-setsize="-1"><p><span xml:lang="EN-IN" data-contrast="none">Experience driving or contributing to </span><span xml:lang="EN-IN" data-contrast="none">engineering efficiency tools</span><span xml:lang="EN-IN" data-contrast="none"> or developer productivity improvements.</span><span data-ccp-props="{"201341983":0,"335559739":0,"335559740":240,"469777462":[720],"469777927":[0],"469777928":[8]}"> </span></p></li></ul></div><div><ul style="list-style-type: disc;" role="list"><li role="listitem" data-aria-level="1" data-aria-posinset="4" data-list-defn-props="{"335552541":1,"335559685":360,"335559991":360,"469769226":"Symbol","469769242":[8226],"469777803":"left","469777804":"","469777815":"multilevel"}" data-listid="15" data-font="Symbol" data-leveltext="" aria-setsize="-1"><p><span xml:lang="EN-IN" data-contrast="none">Exposure to security, compliance, and operational best practices for cloud-based or AI-driven services.</span><span data-ccp-props="{"201341983":0,"335559739":0,"335559740":240,"469777462":[720],"469777927":[0],"469777928":[8]}"> </span></p></li></ul><p><span data-ccp-props="{"201341983":0,"335559739":0,"335559740":240,"469777462":[720],"469777927":[0],"469777928":[8]}"><span xml:lang="EN-US" data-contrast="none">#AIPlatform #DistributedSystems #CloudScale #ModelServing</span><span data-ccp-props="{"201341983":0,"335559739":0,"335559740":240}"> </span> </span></p><p> </p><p><span data-ccp-props="{"201341983":0,"335559739":0,"335559740":240,"469777462":[720],"469777927":[0],"469777928":[8]}"><span data-teams="true"><strong>Other Requirements</strong>:Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud background check upon hire/transfer and every two years thereafter. </span> </span></p></div></div> <br><p>This position will be open for a minimum of 5 days, with applications accepted on an ongoing basis until the position is filled.</p><br><hr><br><p>Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance with religious accommodations and/or a reasonable accommodation due to a disability during the application process, read more about <a href="https://careers.microsoft.com/v2/global/en/accessibility.html"><b><u>requesting accommodations.</u></b></a></p>