Template to integrate LLMs as PTransform using langchain-beam [391914250]

Assigned

Feature Request

Status Update

No update yet.

Description

ga...@gmail.com

created issue #1

Jan 24, 2025 07:28AM

What you would like to accomplish:

- Leverage LLMs capabilities for data processing and create RAG based pipelines in dataflow and apache beam.
- Create vector embeddings for text directly within the pipeline.

How this might work:

langchain-beam java library is a apache beam and langchain integration. it provides transforms to integrate LLMs and embeddings models from gemini, openai and other providers in beam pipeline.

Goal is expose those expose those transform as template for easy integration in Dataflow.

repository link -

https://github.com/Ganeshsivakumar/langchain-beam

langchainbeam.png

72 KB

View

Download

Comments

va...@google.com <va...@google.com> Jan 28, 2025 06:23AM

Assigned to si...@google.com.

si...@google.com <si...@google.com> #2Jan 28, 2025 07:11AM

This feature request has been forwarded to the Data Fusion engineering team so that they may evaluate it. Note that there are no ETAs or guarantees of implementation for feature requests. All communication regarding this feature request is to be done here.

ga...@gmail.com <ga...@gmail.com> #3Jan 29, 2025 04:10PM

Struct support is being prioritized and can be tracked in open source by

https://issues.cask.co/browse/CDAP-15349.

si...@google.com <si...@google.com> #4Feb 4, 2025 07:15AM

Reassigned to gc...@google.com.

Hello,

Thank you for your response! This has been forwarded to the Cloud DataFlow Engineering Team so that they may evaluate it. Note that there are no ETAs or guarantees of implementation for feature requests. All communication regarding this feature request is to be done here.

Message last modified on Feb 4, 2025 07:18AM

Issue 391914250

Description

Issue summary

Comments

va...@google.com <va...@google.com> Jan 28, 2025 06:23AM

si...@google.com <si...@google.com> #2Jan 28, 2025 07:11AM

ga...@gmail.com <ga...@gmail.com> #3Jan 29, 2025 04:10PM

si...@google.com <si...@google.com> #4Feb 4, 2025 07:15AM

Add comment

Issue metadata