Databricks Secures Lilac AI Acquisition to Enhance LLM Training Data Quality
#Acquisition #AI #artificialintelligence #dataquality #Databricks #Gardentool #GenerativeAIapplications #LilacAI #llm #LLMtraining #machinelearning #Software
multiplatform.ai/databricks-sec…
We just released [email protected]!
This release brings in Monaco (the VSCode engine) to render documents, with powerful context menus for searching, labeling concepts, and more.
We also added UI support for common ChatML formats like ShareGPT
Release notes: github.com/lilacai/lilac/…
[email protected] is now released.
We now support:
- Loading custom embeddings
- Exporting directly to HuggingFace
Release notes: github.com/lilacai/lilac/…
In addition, I've worked with Nikhil Thorat and the Lilac team to cluster the datasets into clusters for analysis and further curation!
You can access the dataset in Lilac's Hugging Face
Spaces here: lilacai-lilac.hf.space/datasets#lilac…
And can access the clusters by clicking this button:
OpenHermes-2.5 dataset is finally here!
We're hosting the full dataset with pre-computed Lilac, joining Databricks! clusters in our demo.
Clusters: lilacai-lilac.hf.space/datasets#lilac…
Explore the GAIR-lima fine tuning dataset from the Mosaic blog post below using our Lilac Hugging Face space!
lilacai-lilac.hf.space/datasets#lilac…
'Databricks acquires Lilac AI to enhance data quality for generative AI applications.' #Databricks #LilacAI #AI #DataManagement
infoworld.com/article/371468…
Alex Volkov (Thursd/AI) Teknium open sources Hermes dataset
Announcement: x.com/teknium1/statu…
Dataset: huggingface.co/datasets/tekni…
Lilac: lilacai-lilac.hf.space/datasets#lilac…
Lilac announces Garden - LLM powered clustering for datasets
Announcement: x.com/lilac_ai/statu…