LLaVA-LoRA-nil-final-weights: Fine-Tuned Vision-Language Model Weights
Available on 1 platform
Sign in to view source links and access this dataset
Description
A set of final model weights for the LLaVA (Large Language-and-Vision Assistant) model, fine-tuned using Low-Rank Adaptation (LoRA). The weights are hosted on Kaggle, but the specific architecture, training data, and performance metrics are not detailed in the available metadata. The dataset's author, organization, and last update date are unknown.
Use Cases
Fine-tune a vision-language model for specific image captioning tasks (inferred from domain, verify after download)
Evaluate the performance of a LoRA-adapted LLaVA model on benchmark datasets (inferred from domain, verify after download)
Use as a starting point for research on efficient adaptation of large multimodal models (inferred from domain, verify after download)
Strengths
Published on Kaggle.
Limitations
Metadata is minimal; actual content requires verification after download.
Row count, file formats, and column definitions are unknown, which may limit suitability assessment.
Data may reflect bias inherent to the unspecified source training data.
License is unknown; users must verify permissible use before downloading.