Home of workflow orchestration for data and ML at scale.
Thank you! Your submission has been received!
Oops! Something went wrong while submitting the form.
Union Team
•
August 26, 2025
AI Orchestration
LLMs
Newsletter
So you want to use SLMs. Can you really handle them?
SLMs help bridge the experiment-to-production gap, but only if you use them the right way.
Read the story→
Sage Elliott
•
August 22, 2025
Bioinformatics
Flyte 2.0
Fireside Chat: How Healthcare and Biotech Teams Build Secure, Compliant AI Infrastructure
Leaders from Artera AI and Union.ai came together to discuss how top healthcare and biotech teams build scalable, compliant AI infrastructure. Learn best practices in workflow orchestration in healthcare, data locality, security, and global AI deployment.
Read the story→
Samhita Alla
•
August 19, 2025
Vectors
S3 Vectors
Flyte 2.0
AWS
Amazon S3 Vectors Is Here. Flyte 2.0 Already Supports It.
Amazon S3 Vectors is the first cloud object store with native support for storing and querying vectors. That’s a big deal; no need for a separate vector database if you’re already on AWS.
Read the story→
David Espejo
•
August 12, 2025
Crusoe
Crusoe Cloud
AI Orchestration
Clean Compute
Union.ai + Crusoe Cloud Power Next-Gen AI Workloads
Modern AI workloads are growing in complexity, scale, and cost. As AI and ML teams build longer-running workflows and agents, AI orchestration becomes an essential tool in the implementation of their systems.
Read the story→
Ketan Umare
•
July 29, 2025
Press
Introducing Flyte 2.0: Dynamic, Crash-Proof, Resource-Aware AI Orchestration
This article charts the most pressing problems that orchestration can solve, and introduces the vision for Flyte 2.0. We’ll publish technical deepdives on Flyte’s capabilities over coming weeks.
Read the story→
Ketan Umare
•
July 2, 2025
AI Orchestration
Agents
Machine Learning
Unified AI Platform
The Evolution and Future of AI Orchestration
Let’s chart the history of orchestration and explore why it has become a non-negotiable for the future of AI and agents.
Read the story→
Samhita Alla
•
June 5, 2025
Integration
Serving
MLOps
Who Said RAG in Production Had to Be Hard? Not with Union.ai and W&B Weave
At Union.ai, our philosophy is simple: provide you with every tool needed to unlock maximum value from your AI infrastructure.
Read the story→
Samhita Alla
•
May 27, 2025
Integration
AI Workflows
Agents
Serving on Your Terms: Full-Stack LLM and RAG Observability with Arize + Union.ai
Bringing transparency and control to every stage of your model and app lifecycle.
Read the story→
David Espejo
•
May 15, 2025
Geospatial
Data
Plugins
Building Large-Scale Xarray Datasets for Geospatial Computing with Union.ai and Flyte
As geospatial machine learning engineers, we often face a common challenge.
Read the story→
Sage Elliott
•
April 25, 2025
AI Orchestration
AI Workflows
GPU Costs
Reusable Containers
Build Faster AI Pipelines with Union.ai Actors: Reuse Containers, Skip Cold Starts
When building real-world AI systems, performance isn’t just about how fast your models run when deployed — it's also about how efficiently your infrastructure supports them.
Read the story→
Union Team
•
April 7, 2025
Data Processing
Data Engineering
Company
Union.ai Earns Google Cloud Ready - BigQuery Designation to Strengthen AI Workflow Integrations
SEATTLE, WA— Union.ai, the developer suite for orchestrating, training, and serving AI models, today announced that it has achieved the Google Cloud Ready - BigQuery Designation.
Read the story→
Sage Elliott
•
March 6, 2025
LLMs
LLM Observability: A Fireside Chat with John from Arize
In this fireside chat, we talk with Arize’s Head of Developer Relations John Gilhuly about LLM Observability: Tracing, Evaluations, and Real-Time Insights.
Read the story→
Sage Elliott
•
March 3, 2025
Machine Learning
MLOps
AI Orchestration
Reproducible Workflows for Compound AI: Reliable and Scalable AI Development
In AI and machine learning, the need for reproducibility is essential for ensuring reliability, transparency, and trustworthiness of models and experiments.
Read the story→
Samhita Alla
•
February 24, 2025
Machine Learning
Model Deployment
AI Workflows
Building Production-Ready Compound AI Applications Just Got a Lot Easier: A RAG Example
Imagine if building compound AI apps was as easy as piecing together Legos. At Union, we're abstracting the infrastructure layer to make this a reality.
Read the story→
David Espejo
•
February 24, 2025
Security
Building Secure AI Systems with Union's Defense-in-Depth Approach
As machine learning systems become deeply embedded in software infrastructure, their security vulnerabilities pose critical risks.
Read the story→
Ketan Umare
•
January 21, 2025
Newsletter
Serving
Unified AI Platform
Union’s January Newsletter
Read on for new features, upcoming events, and job openings.
Read the story→
David Espejo
•
January 14, 2025
Integration
Data Engineering
Notebooks
Jupyter Notebooks and Union: Automatically Accelerate the Time-to-Value of ML Models
In the fast-paced world of machine learning, data scientists and researchers are constantly battling infrastructure challenges that slow down innovation and drain organizational resources.
Read the story→
Pryce Turner
•
January 8, 2025
Bioinformatics
AI Orchestration
AI Workflows
Protein Folding: An Example of Bioinformatics with Union
Protein folding has profound implications for understanding disease, drug development, and fundamental life processes.
Read the story→
Ketan Umare
•
December 18, 2024
Unified AI Platform
Serving
Debugging
Union’s December Newsletter
Read on for a re:Invent recap, new features, upcoming events, and job openings.
Read the story→
Sage Elliott
•
December 11, 2024
Unified AI Platform
Cost Observability
GPU Costs
Cost Observability for AI: Transparency That Lowers Costs
Building scalable machine learning projects is challenging enough without the added burden of hidden or unpredictable infrastructure costs.
Read the story→
Sage Elliott
•
December 4, 2024
Unified AI Platform
Reusable Containers
AI Workflows
Actors: Faster, Cheaper AI Workflows with Stateful Containers
AI workflows often use containers as isolated environments per task, but each spin-up requires initializing dependencies, configuring models, and loading data, slowing down workflows that need rapid, repeated executions.
Read the story→
Sage Elliott
•
December 2, 2024
Unified AI Platform
Inference
AI Workflows
Union: The Unified AI Platform
Developing and deploying AI models at scale is challenging. Many teams face obstacles like disconnected workflows, runaway or prohibitive infrastructure costs, and slow time-to-market for AI solutions.
Read the story→
Sage Elliott
•
November 21, 2024
NVIDIA
Company
Events
Join Union at AWS re:Invent 2024!
Visit Union in the re:Invent Expo Hall as part of the NVIDIA booth #1620.
Read the story→
Niels Bantilan
•
November 4, 2024
Data Quality
Open-source
Pandera Brings Code Coverage Standards for Data Quality in AI
Reaching the amazing milestone of 50 million downloads, we asked Niels Bantilan, Pandera’s creator, to reflect on the journey of getting here and what might be next.
Read the story→
Niels Bantilan
•
October 24, 2024
AI Orchestration
GPUs
LLMs
MLOps
Reproducing Liger Kernel Benchmarks on Phi3 Mini
Reproducibility is the cornerstone of science and engineering: Without it, we can’t reliably build on top of each other’s findings and work.
Read the story→
Samhita Alla
•
October 15, 2024
Serverless
LLMs
Model Training
Model Deployment
Building an iOS App to Serve a Fine-Tuned Llama Model with Union and MLC-LLM
So, I had this idea the other day: What if I could get ChatGPT-like AI to run on my iPhone and speak to me in my native language, Telugu? Not through some API or cloud service, but actually running directly on my phone.
Read the story→
Samhita Alla
•
October 7, 2024
LLMs
Model Training
Model Deployment
Inference
Serve Fine-tuned LLMs with Ollama
Ollama is a platform designed to simplify running open-source large language models (LLMs) locally.
Read the story→
David Espejo
•
October 1, 2024
Community
Flyte at Hacktoberfest 2024
Flyte is back again for another exciting round at this year’s Hacktoberfest!
Read the story→
Ketan Umare
•
September 17, 2024
Company
Press
Union Welcomes David Jakubowski as President
I’m excited to announce that David Jakubowski has joined Union as President.
Read the story→
Sage Elliott
•
September 17, 2024
Computer Vision
Podcast
Machine Learning
Fireside Chat with Leo Dirac: The Future of Computer Vision and Robotics
In this fireside chat, we talk with Groundlight co-founder and CTO Leo Dirac about the future of computer vision and robotics.
Read the story→
Samhita Alla
•
August 20, 2024
NVIDIA
AI Orchestration
Union Powers Faster End-to-End AI Application Deployment using NVIDIA NIM
At Union, we understand the complexities of deploying generative AI in production.
Read the story→
Thomas Fan
•
August 19, 2024
Machine Learning
AI Orchestration
Comet Integration with Union & Flyte
In the machine learning (ML) and artificial intelligence (AI) domain, managing, tracking, and visualizing model training processes, especially at scale, is a significant challenge.
Read the story→
Thomas Fan
•
August 19, 2024
Machine Learning
AI Orchestration
Flyte and Weights & Biases Integration
With Flyte’s latest plugin for Weights & Biases, you can now effectively run Machine Learning or AI workflows on Union and integrate with Weights & Biases capabilities.
Read the story→
David Espejo
•
August 13, 2024
Community
Flyte on Azure: A Reference Implementation
It’s been a journey. There have been users successfully running Flyte on Azure but, unfortunately, nothing publicly documented.
Read the story→
Katrina Rogan
•
August 1, 2024
Serverless
NVIDIA
GPUs
Union Serverless Broadens Support for NVIDIA Accelerated Computing
We are excited to announce that Union Serverless users can now harness the power of NVIDIA A100 Tensor Core GPUs and NVIDIA L4 Tensor Core GPUs.
Read the story→
Pryce Turner
•
July 31, 2024
Bioinformatics
NVIDIA Parabricks on Flyte: Orchestrating Accelerated Bioinformatics
NVIDIA Parabricks is a software suite that accelerates genomic sequence analysis by reimplementing industry standard tools to use the parallel...
Read the story→
Niels Bantilan
•
July 29, 2024
Data Quality
Pandera 0.20.0: Pyarrow Data Type Support
Pandera v0.20.* now supports Pyarrow data types in the pandas validation engine 🚀.
Read the story→
Pryce Turner
•
July 10, 2024
Accelerated Datasets
Reduce the Runtime & Memory Requirements of your Workloads by more than 50% with Accelerated Datasets
Bioinformaticians and ML engineers often need to balance performance versus cost when running data-intensive workloads.
Read the story→
Haytham Abuelfutuh
•
June 28, 2024
Serverless
Introducing Union Serverless
Orchestrate your AI, with serverless execution and pay-as-you-go pricing.
Read the story→
Samhita Alla
•
June 18, 2024
Agents
Cut OpenAI Batch Pipeline Costs by Over 50% with Zero Container Overhead
OpenAI real-time client is ideal for real-time responses, especially for use cases and needs where price is not a factor...
Read the story→
Niels Bantilan
•
June 3, 2024
Artifacts
Data-aware, Event-driven AI Orchestration with Artifacts
Artifacts provide a core abstraction that serves as an interface between the different teams that work together to build AI-powered products.
Read the story→
Haytham Abuelfutuh
•
May 29, 2024
Article
Scaling patterns for batch workloads on K8s
Batch workloads often involve large data processing and dependency on other batch workloads.
Read the story→
Samhita Alla
•
May 23, 2024
Model Deployment
AWS
Union unveils a powerful model deployment stack built with AWS Sagemaker & NVIDIA Triton inference server
Machine learning engineering is a complex process, spanning data procurement and processing, model training, deployment, and scaling.
Read the story→
Kevin Su
•
May 9, 2024
Article
Faster Airflow to Flyte migration powered by Flyte Airflow Agents
We have had the privilege of seeing data teams experience the value of a unified platform for both machine learning and data pipelines.
Read the story→
Niels Bantilan
•
May 7, 2024
Data Quality
Pandera 0.19.0: Polars DataFrame Validation
The day is finally here! Pandera 0.19.0 ships support for Polars.
Read the story→
Sage Elliott
•
May 2, 2024
MLOps
Community
Podcast
The Essential Role of Vector Databases in LLMOps
Zilliz provides enterprise-grade AI technologies, including one of the world’s most popular open-source vector databases, Milvus.
Read the story→
Samhita Alla
•
May 1, 2024
Article
Open-Source Video Dubbing Using Whisper, M2M, Coqui XTTS, and Sad Talker
AI video dubbing or translation has surged in popularity, breaking down language barriers and enabling communication across diverse cultures.
Read the story→
Thomas Fan
•
April 29, 2024
Article
Performance Tuning AI Models with NVIDIA DGX Cloud
As generative AI models become more capable and deployed in various contexts, optimizing the model in terms of throughput and memory consumption...
Read the story→
John Votta
•
April 16, 2024
Article
Move Fast and Don’t Break Things: Introducing Artifacts Lineage and Reactive Workflows
Today’s AI development lifecycle is marked by multiple teams collaborating to create AI-driven products.
Read the story→
Jason Porter
•
April 15, 2024
Feature
A Union-Inspired FlyteConsole
Here at Union, we’ve been hard at work enhancing the FlyteConsole.
Read the story→
Sage Elliott
•
April 9, 2024
Community
Reflections from NVIDIA GTC 2024: Innovations, Insights, and the Future Unveiled
After an electrifying NVIDIA GTC 2024, I’ve been back in Seattle going through my notes, photos, and a whirlwind of memories!
Read the story→
Samhita Alla
•
April 9, 2024
Article
Deploy Segment Anything Model (SAM) for Inference on Amazon SageMaker
Explore an end-to-end workflow: fine-tuning SAM, batch prediction, user approval, and deployment on SageMaker
Read the story→
Kevin Su
•
April 3, 2024
Feature
Flyte Agents: A Developer Perspective
In modern data-driven and machine-learning workflows, efficient orchestration & execution of tasks are crucial in achieving productivity & scalability
Read the story→
Union Team
•
March 26, 2024
Press
Union Joins NVIDIA Inception to Accelerate AI Innovation
Collaboration underscores Union's commitment to democratize AI application development
Read the story→
Haytham Abuelfutuh
•
March 21, 2024
Article
Need for multi-cloud and multi-cluster for AI
Traditionally, organizations have selected a single computing provider on which to base their operations.
Read the story→
Niels Bantilan
•
March 20, 2024
Data Quality
Pandera 0.18: Global and granular validation controls
Pandera 0.18 introduces two new configuration settings that control how validation happens.
Read the story→
Ketan Umare
•
March 12, 2024
Feature
Flyte Agents Framework
Flyte is a workflow orchestrator that unifies machine learning, data engineering, & data analytics stacks for building robust & reliable applications.
Read the story→
John Votta
•
March 7, 2024
Article
Achieving Reproducible Workflows with Flyte
This is part 3 of a blog series on the need for end-to-end reproducibility in AI development.
Read the story→
Sage Elliott
•
March 4, 2024
Events
Company
Union.ai at NVIDIA GTC 2024: Building AI Together
NVIDIA's GTC 2024 is just around the corner, and tech enthusiasts, industry leaders, and innovators worldwide are gearing up!
Read the story→
John Votta
•
February 29, 2024
MLOps
Towards a Reproducible AI Solution
In a previous post we highlighted the need for reproducibility in AI development, and explored some of the challenges of achieving this...
Read the story→
Shalabh Chaudhri
•
February 26, 2024
Features
Cloud
Union is now available on Google Cloud Platform Marketplace!
We’re excited to announce that the Union platform is now available on the Google Cloud Platform (GCP) Marketplace.
Read the story→
John Votta
•
February 21, 2024
MLOps
AI Orchestration
Why AI Reproducibility is Hard
The recent explosion of interest in AI has sparked a surge of investment aimed at bringing the technology into virtually every consumer and...
Read the story→
John Votta
•
February 15, 2024
Features
Iterating at Scale with Interactive Tasks in Union
Developing AI workflows locally, then deploying to remote environments can lead to problems and slow downs.
Read the story→
Deva DeDios
•
January 8, 2024
Community
Flyte Weekly Community Raffle: Let’s Soar to New Heights
Hello, Flyte community! The Union team behind Flyte is thrilled to announce the Flyte Community Raffle!
Read the story→
Sage Elliott
•
January 5, 2024
LLMs
Computer Vision
Podcast
How LLMs Are Transforming Computer Vision
Voxel51 is the data-centric machine learning software company behind FiftyOne, an open-source toolkit for building computer vision workflows.
Read the story→
David Espejo
•
January 3, 2024
AI Orchestration
Cloud
Flyte for GCP: A Platform Engineer’s Overview
Flyte is an AI orchestration platform for rapid Data, ML, & AI development, particularly useful for scaling pipelines in production environments.
Read the story→
Pryce Turner
•
December 21, 2023
Company
Community
Unleashing the Power of Community: An Engineer’s Journey from Open Source to Full-Time Engagement
Pryce recently joined the Union team as a Solutions Architect.
Read the story→
Sara Gawlinski
•
December 21, 2023
Features
Cloud
Union: Now available on AWS Marketplace!
We’re excited to open the Union platform to AWS developers as it is now available in the AWS Marketplace.
Read the story→
Pryce Turner
•
December 6, 2023
Bioinformatics
MLOps
Human-in-the-Loop Pipelines
By walking through a genomic alignments code and a Streamlit app, explore how Union makes it easier to connect external inputs to pipelines.
Read the story→
David Espejo
•
October 24, 2023
Company
Community
What have I learned about Developer Relations, Open Source, and AI at Union.ai?
What’s It Like Working at Union.ai?
Read the story→
Sara Gawlinski
•
October 24, 2023
LLMs
Model Training
Fine-Tuning LLMs On-Prem with Union and Platform9 at KubeCon 2023
It’s KubeCon season, and cloud-native enthusiasts will flock to Chicago November 6-9 to hear the latest innovations to all things Kubernetes.
Read the story→
Niels Bantilan
•
October 18, 2023
Data Quality
Pandera 0.17 Adds Support for Pydantic v2
I’m super excited to announce that Pandera now supports pydantic v2! 🎉
Read the story→
Deva DeDios
•
October 11, 2023
Events
Union at MLOps World Conference 2023
Union.ai is thrilled to attend and sponsor the MLOps World Conference for the second year in a row!
Read the story→
Sara Gawlinski
•
October 5, 2023
Machine Learning
Bioinformatics
Events
Sequences and Systems: The Convergence of Machine Learning and Biotech
When NGS applied the power of massively parallel processing to the analysis of DNA, it transformed the biotech playing field.
Read the story→
James Sutton
•
September 18, 2023
MLOps
AI Orchestration
GPUs in MLOps: Optimization, Pitfalls, and Management
Graphics Processing Units have roots deep in gaming, but they’re serious business when it comes to machine learning.
Read the story→
Samhita Alla
•
September 13, 2023
LLMs
Data Processing
Data Quality
Fine-Tuning Insights: Using LLMs as Preprocessors to Improve Dataset Quality
LLMs for data cleaning: Yay or nay?
Read the story→
Matthew Rothenberg
•
September 11, 2023
LLMs
Dollars to Data: Kelsey Hightower Talks LLMs in the Enterprise
Kubernetes guru Kelsey Hightower spoke with Union.ai about the business case for LLMs and AI and their potential to reshape the workplace.
Read the story→
Niels Bantilan
•
August 30, 2023
LLMs
Model Training
Fine-tune Llama 2 with Limited Resources
Do more with less: Refine the 70 billion parameter Llama 2 model on your dataset with a bunch of T4s
Read the story→
Matthew Rothenberg
•
August 24, 2023
LLMs
LLMs in the enterprise — what’s chat got to do with it?
Large Language Models (LLMs) are this season’s hot technology topic, but what will it take to establish their lasting success in the enterprise?
Read the story→
Samhita Alla
•
August 21, 2023
LLMs
Model Training
Prompt Engineering
Large Language Models in Production
A 5-minute cheatsheet outlining the challenges and solutions for building Large Language Models for production.
Read the story→
David Espejo
•
August 10, 2023
Machine Learning
AI Orchestration
Good Systems Gone Bad: The Art of Performing at Scale
Friday afternoon, you’re browsing the website of your usual event-ticketing portal trying to find something interesting to do this weekend.
Read the story→
Sara Gawlinski
•
August 1, 2023
AI Orchestration
Introducing the Modern AI Orchestration Resource Hub
Artificial intelligence pipelines are infrastructure intensive. They may entail machine- or deep-learning models that are more iterative...
Read the story→
Niels Bantilan
•
July 28, 2023
Data Quality
Pandera 0.16: Going Beyond Pandas Data Validation
I’m super excited to announce the availability of Pandera 0.16! This release features a suite of improvements and bug fixes.
Read the story→
Samhita Alla
•
July 4, 2023
LLMs
Model Training
Fine-Tuning Insights: Lessons from Experimenting with RedPajama Large Language Model on Flyte Slack Data
Large language models (LLMs) have taken the world by storm, revolutionizing our understanding and generation of human-like text.
Read the story→
Deva DeDios
•
June 8, 2023
Events
Union at Toronto Machine Learning Summit
Union.ai is excited to share that we will be attending and sponsoring the Toronto Machine Learning Summit conference happening from June 12-13.
Read the story→
Samhita Alla
•
May 23, 2023
LLMs
Inference
AI Orchestration
Parallel Audio Transcription: Using Whisper, JAX and Flyte Map Tasks for Streamlined Batch Inference
Imagine you have a dataset of thousands of audio files that need to be transcribed.
Read the story→
Matthew Rothenberg
•
May 19, 2023
Company
Open-source
Manish Patel: ‘Now is the exact right moment for Union’
The founder of VC Nava Ventures describes the big vision he sees for Union.ai.
Read the story→
Martin Stein
•
May 17, 2023
Company
Union.ai announces $19.1M Series A and launches all-new Union Cloud to simplify AI and data workflows for all
Union Cloud enables any organization to operationalize complex AI with data ownership, governance and cost efficiency.
Read the story→
Niels Bantilan
•
May 15, 2023
LLMs
Model Training
Prompt Engineering
Fine Tuning vs. Prompt Engineering Large Language Models
When to manipulate the input prompt and when to roll up your sleeves and update parameter weights.
Read the story→
Sara Gawlinski
•
May 3, 2023
Events
Company
PyData Seattle 2023 In Review
The Union.ai team kept busy last week at PyData Seattle 2023.
Read the story→
David Espejo
•
May 3, 2023
Company
Union Cloud is SOC 2 Type II Certified
We’re excited to announce that Union Cloud has achieved the SOC2 Type II Certification for Security, Availability and Integrity.
Read the story→
Sara Gawlinski
•
May 3, 2023
Events
Union at PyData Seattle 2023
Every year, PyData Seattle brings together experts, practitioners, and enthusiasts in the field of data science and machine learning.
Read the story→
Sandra Youssef
•
February 28, 2023
Company
Meet Union.ai at Gartner
The World’s Most Important Gathering of Data and Analytics Leaders
Read the story→
David Espejo
•
February 14, 2023
Events
Cloud
Cloud Native Security Con 2023: highlights
I had the chance to attend Cloud Native Security Con North America 2023, which ran Feb. 1 and 2 at the Seattle Convention Center.
Read the story→
Niels Bantilan
•
February 1, 2023
Machine Learning
The ML Doctor Says: Don’t Build Fancy Models Before You Set a Simple Baseline
So you’ve taken a few online courses in machine learning and landed a data scientist role in your first industry job.
Read the story→
Sharon Florentine
•
January 10, 2023
Company
MLOps
Union’s Andrew Dye describes his journey to MLOps
Union.ai Systems Engineer Andrew Dye sat down with Demetrios Brinkmann and David Aponte of MLOps Community for an MLOps Coffee Session.
Read the story→
Niels Bantilan
•
December 20, 2022
Data Engineering
Data Processing
Structured Dataset
Tabular data is ubiquitous today and nowhere more so than in data-engineering and data science tasks, where inputs and outputs are typically...
Read the story→
Niels Bantilan
•
December 7, 2022
Features
Machine Learning
UnionML 0.2.0 Integrates with BentoML
One of the most challenging aspects of building machine learning-driven applications is what I like to call “the deployment chasm.”
Read the story→
Niels Bantilan
•
November 28, 2022
Data Quality
Pandera: A Statistical Data Testing Framework in Python
I first learned about DataFrames very early in my data-science journey, learning Python and R in tandem.