Spaces:

NHLOCAL
/

gemini-subtitle-creator

Running

App Files Files Community

NHLOCAL commited on Aug 10

Commit

9729033

1 Parent(s): 2ad4451

שיפור: אפשרות להוספת קובץ וידאו

Browse files

Files changed (3) hide show

README.md +1 -1
main.py +3 -3
templates/index.html +4 -4

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ app_port: 7860
 ### כיצד להשתמש?
 1.  **הזן מפתח API:** הדבק את מפתח ה-API שלך מ-Google AI Studio בתיבה המיועדת. ניתן להשיג מפתח [כאן](https://aistudio.google.com/app/apikey).
-2.  **העלה קובץ שמע:** לחץ או גרור קובץ שמע בפורמט נתמך (כגון MP3, WAV, M4A).
 3.  **התחל תמלול:** לחץ על הכפתור "התחל תמלול" והמתן לסיום התהליך.
 4.  **הורד תוצאות:** לאחר סיום התמלול, תוכל לראות את התוצאה ולהוריד את קובץ ה-SRT.

 ### כיצד להשתמש?
 1.  **הזן מפתח API:** הדבק את מפתח ה-API שלך מ-Google AI Studio בתיבה המיועדת. ניתן להשיג מפתח [כאן](https://aistudio.google.com/app/apikey).
+2.  **העלה קובץ שמע או וידאו:** לחץ או גרור קובץ שמע או וידאו בפורמט נתמך (כגון MP3, MP4, WAV, M4A).
 3.  **התחל תמלול:** לחץ על הכפתור "התחל תמלול" והמתן לסיום התהליך.
 4.  **הורד תוצאות:** לאחר סיום התמלול, תוכל לראות את התוצאה ולהוריד את קובץ ה-SRT.

main.py CHANGED Viewed

@@ -234,9 +234,9 @@ async def _transcribe_and_stream(api_key: str, file_content: bytes, model_name:
         return json.dumps({"type": type, "message": message, "percent": percent, "data": data}) + "\n\n"
     try:
         system_prompt, pydantic_schema = load_system_prompt(), TranscriptionSegment
-        yield send_event("progress", "מעבד את קובץ השמע...", 5)
         audio = AudioSegment.from_file(io.BytesIO(file_content))
-        yield send_event("progress", f"אורך הקובץ {len(audio) / 60000:.1f} דקות. מבצע חלוקה...", 15)
         chunks = await asyncio.to_thread(split_audio_webrtcvad, audio, MIN_SILENCE_LEN_MS)
         if not chunks: raise ValueError("לא נוצרו מקטעי שמע לעיבוד.")
         chunk_info_messages = [f"{i+1}. {format_ms_to_srt_time(sum(len(c) for c in chunks[:i]))} - {format_ms_to_srt_time(sum(len(c) for c in chunks[:i+1]))}" for i in range(len(chunks))]
@@ -357,4 +357,4 @@ async def handle_transcription_stream(api_key: str = Form(...), model_name: str
     if not all([api_key, model_name, audio_file]):
         raise HTTPException(status_code=400, detail="Required form fields are missing.")
     file_content = await audio_file.read()
-    return StreamingResponse(_transcribe_and_stream(api_key, file_content, model_name, user_prompt), media_type="text/event-stream")

         return json.dumps({"type": type, "message": message, "percent": percent, "data": data}) + "\n\n"
     try:
         system_prompt, pydantic_schema = load_system_prompt(), TranscriptionSegment
+        yield send_event("progress", "מעבד את קובץ המדיה...", 5)
         audio = AudioSegment.from_file(io.BytesIO(file_content))
+        yield send_event("progress", f"אורך פס הקול {len(audio) / 60000:.1f} דקות. מבצע חלוקה...", 15)
         chunks = await asyncio.to_thread(split_audio_webrtcvad, audio, MIN_SILENCE_LEN_MS)
         if not chunks: raise ValueError("לא נוצרו מקטעי שמע לעיבוד.")
         chunk_info_messages = [f"{i+1}. {format_ms_to_srt_time(sum(len(c) for c in chunks[:i]))} - {format_ms_to_srt_time(sum(len(c) for c in chunks[:i+1]))}" for i in range(len(chunks))]
     if not all([api_key, model_name, audio_file]):
         raise HTTPException(status_code=400, detail="Required form fields are missing.")
     file_content = await audio_file.read()
+    return StreamingResponse(_transcribe_and_stream(api_key, file_content, model_name, user_prompt), media_type="text/event-stream")

templates/index.html CHANGED Viewed

@@ -113,13 +113,13 @@
                     </div>
                     <div class="input-group">
-                        <label for="audio-file-input">קובץ שמע (mp3, wav, m4a, etc.)</label>
                         <label for="audio-file-input" class="file-input-wrapper" id="audio-drop-zone">
                             <span class="material-symbols-outlined">upload_file</span>
                             <p>לחץ לבחירת קובץ או גרור לכאן</p>
                             <div id="audio-file-name" class="file-name"></div>
                         </label>
-                        <input type="file" id="audio-file-input" accept="audio/*" required>
                     </div>
                     <div class="buttons-container" style="margin-top: 2rem; flex-wrap: wrap; justify-content: center;">
@@ -271,13 +271,13 @@
                 if (audioFileInput.disabled) return;
                 audioDropZone.classList.remove('drag-over');
                 const droppedFile = e.dataTransfer.files[0];
-                if (droppedFile && droppedFile.type.startsWith('audio/')) {
                     audioFile = droppedFile;
                     audioFileInput.files = e.dataTransfer.files;
                     audioFileNameEl.textContent = `קובץ: ${audioFile.name}`;
                 } else {
                     audioFile = null;
-                    audioFileNameEl.textContent = 'יש לבחור קובץ שמע בלבד';
                 }
                 checkInputs();
             });

                     </div>
                     <div class="input-group">
+                        <label for="audio-file-input">קובץ שמע או וידאו (mp3, mp4, wav, m4a, etc.)</label>
                         <label for="audio-file-input" class="file-input-wrapper" id="audio-drop-zone">
                             <span class="material-symbols-outlined">upload_file</span>
                             <p>לחץ לבחירת קובץ או גרור לכאן</p>
                             <div id="audio-file-name" class="file-name"></div>
                         </label>
+                        <input type="file" id="audio-file-input" accept="audio/*,video/*" required>
                     </div>
                     <div class="buttons-container" style="margin-top: 2rem; flex-wrap: wrap; justify-content: center;">
                 if (audioFileInput.disabled) return;
                 audioDropZone.classList.remove('drag-over');
                 const droppedFile = e.dataTransfer.files[0];
+                if (droppedFile && (droppedFile.type.startsWith('audio/') || droppedFile.type.startsWith('video/'))) {
                     audioFile = droppedFile;
                     audioFileInput.files = e.dataTransfer.files;
                     audioFileNameEl.textContent = `קובץ: ${audioFile.name}`;
                 } else {
                     audioFile = null;
+                    audioFileNameEl.textContent = 'יש לבחור קובץ שמע או וידאו';
                 }
                 checkInputs();
             });