Developer Day
Apr 17, 2025
3:00 pm
-
3:50 pm

Achieving Seamless Ai-augmented Software Development with Automated Etl+ for Genai

Add to Calendar Apr 17, 2025 Apr 17, 2025 America/Los_Angeles Achieving Seamless Ai-augmented Software Development with Automated Etl+ for Genai This workshop will demonstrate how to leverage Unstructured.io's fully automated ETL+ solution to streamline data processing for AI-driven software development. We start with ETL for GenAI: continuously harvesting newly generated unstructured data from systems of record, transforming it into LLM-ready formats using our optimized, pre-built pipelines, and writing it to downstream locations such as vector or graph databases. Then we add the +: all the features enterprise users need to eliminate ETL headaches and stay focused on applications that drive their business forward. Participants will gain practical insights into using the brand new Playground feature in the Unstructured API: transforming a local unstructured file into partitioned, chunked, enriched, and embedded json, while being able to visualize and understand each step in the pipeline. Participants will also learn about all of Unstructured's offerings: seamlessly integrated third-party services, incremental vector syncing, contextual chunking, ane more. We will also review how to fully customize your ETL pipelines and deploy them within existing infrastructure environments, either with a no code UI or via API. Attendees will leave with actionable strategies to address the complex challenges of handling diverse data types and sources in AI applications. Workshop 2 | Gateway Pavilion - Pier 2 | 2 Marina Blvd, San Francisco, CA 94123

About this session

This workshop will demonstrate how to leverage Unstructured.io's fully automated ETL+ solution to streamline data processing for AI-driven software development. We start with ETL for GenAI: continuously harvesting newly generated unstructured data from systems of record, transforming it into LLM-ready formats using our optimized, pre-built pipelines, and writing it to downstream locations such as vector or graph databases. Then we add the +: all the features enterprise users need to eliminate ETL headaches and stay focused on applications that drive their business forward. Participants will gain practical insights into using the brand new Playground feature in the Unstructured API: transforming a local unstructured file into partitioned, chunked, enriched, and embedded json, while being able to visualize and understand each step in the pipeline. Participants will also learn about all of Unstructured's offerings: seamlessly integrated third-party services, incremental vector syncing, contextual chunking, ane more. We will also review how to fully customize your ETL pipelines and deploy them within existing infrastructure environments, either with a no code UI or via API. Attendees will leave with actionable strategies to address the complex challenges of handling diverse data types and sources in AI applications.

Session Speaker

Nina Lopatina

Staff Developer Relations Engineer
Unstructured

More sessions

Developer Day
Apr 17, 2025
9:40 am

The Full Stack of Open Generative Ai

Join an AI expert from Meta for an in depth look at the latest advancements in the open generative AI stack. This session will cover from the metal to the agent how large scale AI systems are built, the tools used and how you can build your own with open source AI from Meta including PyTorch, Triton, Llama and more.

Developer Day
Apr 17, 2025
10:00 am

Ensuring Responsible AI with Advanced Model Evaluation

This session will cover Sama's comprehensive Model Evaluation services, emphasizing the importance of aligning AI systems with ethical guidelines and improving their performance. Attendees will learn how to systematically assess AI outputs, identify inaccuracies and vulnerabilities, and employ strategies for accelerated time-to-market with robust model validation. Leveraging Sama's expert-driven platform, participants will discover how to build reliable, high-performing generative AI models while adhering to industry standards.

Developer Day
Apr 17, 2025
10:20 am

Building Agents to Automate Knowledge Work

A huge promise of LLM-powered agents is to not only provide simple Q&A search interfaces, but to automate 50%+ of operational work that a knowledge worker typically does. A lot of the day-to-day of a knowledge worker - whether an investment analyst, customer agent, or engineer - is ingesting a large volume of unstructured data and using that to synthesize new insights or take actions. This talk walks through the e2e architecture for building production knowledge agents that can actually do useful work over your data. This requires both a production-quality knowledge management layer as well as agent architectures that can reliably solve certain use cases.