Spaces:

somosnlp-hackathon-2022
/

Audio-Sentiment-Classifier

Running

App Files Files Community

DrishtiSharma commited on Apr 3, 2022

Commit

26a6277

•

1 Parent(s): f443f82

Update info.txt

Browse files

Files changed (1) hide show

info.txt +4 -1

info.txt CHANGED Viewed

@@ -5,4 +5,7 @@ Targeted SGDs: 1) Good health and well being 2) Peace, Justice and Strong Instit
 Potential Applications: Although this is a very small prototype, it can be scaled up to detect user's mood or current mental state. The audio libraries or other media in general can be presented, recommended, and categorized based on the recognized user's mood and preference. A mood lighting system, in addition to the aforementioned features, can be implemented to aid in user's health and overall wellbeing. Additionally, the model can be trained on data with more class labels in order to be useful in Peace, Justice scenarios, particularly in detecting brawls, and any other uneventful scenario. An audio classifier can be integrated in a surveillance system to detect brawls and other unsettling events that can be recognized using "sound."
 To begin with, we didn't have enough audio data in Spanish suitable for sentiment classification tasks. We had to make do with whatever data we could find in the MESD database because much of the material we came across was not open-source.
 The open-source MESD dataset was used to fine-tune the Wav2Vec2 base model, which contains ~1200 audio recordings, all of which were recorded in professional studios and were only one second long. Out of ~1200 audio recordings only 890 of the recordings were utilized for training. Due to these factors, the model and hence this Gradio application may not be able to perform well in noisy environments or audio with background music or noise. It's also worth mentioning that this model performs poorly when it comes to audio recordings from the class "Fear," which the model often misclassifies.
-My teammates were

 Potential Applications: Although this is a very small prototype, it can be scaled up to detect user's mood or current mental state. The audio libraries or other media in general can be presented, recommended, and categorized based on the recognized user's mood and preference. A mood lighting system, in addition to the aforementioned features, can be implemented to aid in user's health and overall wellbeing. Additionally, the model can be trained on data with more class labels in order to be useful in Peace, Justice scenarios, particularly in detecting brawls, and any other uneventful scenario. An audio classifier can be integrated in a surveillance system to detect brawls and other unsettling events that can be recognized using "sound."
 To begin with, we didn't have enough audio data in Spanish suitable for sentiment classification tasks. We had to make do with whatever data we could find in the MESD database because much of the material we came across was not open-source.
 The open-source MESD dataset was used to fine-tune the Wav2Vec2 base model, which contains ~1200 audio recordings, all of which were recorded in professional studios and were only one second long. Out of ~1200 audio recordings only 890 of the recordings were utilized for training. Due to these factors, the model and hence this Gradio application may not be able to perform well in noisy environments or audio with background music or noise. It's also worth mentioning that this model performs poorly when it comes to audio recordings from the class "Fear," which the model often misclassifies.
+<>
+<>
+<>
+Team