Pulumi Launches New Infrastructure Libraries for GenAI Stack
February 21, 2024

Pulumi now offers native ways to manage Pinecone indexes, including its latest serverless indexes.

Pinecone is a serverless vector database with an easy-to-use API that allows developers to build and deploy high-performance AI applications. This is incredibly important as applications involving large language models, generative AI, and semantic search require a vector database to store and retrieve vector embeddings.

Pulumi also now has a template to launch and run LangChain’s LangServe in Amazon ECS, a container management service. This in addition to Pulumi’s existing support in running Next.js frontend applications in Vercel, managing Apache Spark clusters in Databricks and 150+ other cloud and SaaS services.

The GenAI tech stack is new and emerging but has typically consisted of a LLM service and a vector data store. Running this stack on a laptop is fairly simple but getting it to production is far harder. Most of this is done manually through a CLI or a web console, which introduces manual errors and repeatability problems that affect the security and reliability of the product.

Pulumi has made it easy to take a GenAI stack running locally and get it in production in the cloud with Pulumi AI, the fastest way to learn and build Infrastructure as Code (IaC). As GenAI complexity actually relates to cloud infrastructure provisioning and management, Pulumi is purpose built to manage this cloud complexity and is easy to use to support a new use case of AI.

Pulumi allows developers to tie together all the different pieces of infrastructure that goes into their GenAI product and manage it from a simple Python program.

Share this

Industry News

April 23, 2024

mabl announced the addition of mobile application testing to its platform.

April 23, 2024

Spectro Cloud announced the achievement of a new Amazon Web Services (AWS) Competency designation.

April 22, 2024

GitLab announced the general availability of GitLab Duo Chat.

April 18, 2024

SmartBear announced a new version of its API design and documentation tool, SwaggerHub, integrating Stoplight’s API open source tools.

April 18, 2024

Red Hat announced updates to Red Hat Trusted Software Supply Chain.

April 18, 2024

Tricentis announced the latest update to the company’s AI offerings with the launch of Tricentis Copilot, a suite of solutions leveraging generative AI to enhance productivity throughout the entire testing lifecycle.

April 17, 2024

CIQ launched fully supported, upstream stable kernels for Rocky Linux via the CIQ Enterprise Linux Platform, providing enhanced performance, hardware compatibility and security.

April 17, 2024

Redgate launched an enterprise version of its database monitoring tool, providing a range of new features to address the challenges of scale and complexity faced by larger organizations.

April 17, 2024

Snyk announced the expansion of its current partnership with Google Cloud to advance secure code generated by Google Cloud’s generative-AI-powered collaborator service, Gemini Code Assist.

April 16, 2024

Kong announced the commercial availability of Kong Konnect Dedicated Cloud Gateways on Amazon Web Services (AWS).

April 16, 2024

Pegasystems announced the general availability of Pega Infinity ’24.1™.

April 16, 2024

Sylabs announces the launch of a new certification focusing on the Singularity container platform.

April 15, 2024

OpenText™ announced Cloud Editions (CE) 24.2, including OpenText DevOps Cloud and OpenText™ DevOps Aviator.

April 15, 2024

Postman announced its acquisition of Orbit, the community growth platform for developer companies.

April 11, 2024

Check Point® Software Technologies Ltd. announced new email security features that enhance its Check Point Harmony Email & Collaboration portfolio: Patented unified quarantine, DMARC monitoring, archiving, and Smart Banners.