{
  "text": "Imagine the wildest idea that you've ever had, and you're curious about how it might scale to something that's a 100, a 1,000 times bigger."
}

from openai import OpenAI
client = OpenAI()

audio_file= open("data/audio.m4a", "rb")
transcript = client.audio.transcriptions.create(
  model="whisper-1", 
  file=audio_file
)
print(transcript.text)

Sanat tarihiyle ilgilenen veya ilgilenmeyen herkesin kulağına mutlaka Rönesans kelimesi çalınmış olmalı. Duyar duymaz aklımıza bilim, sanat ve yenilikler gelir. Ancak bu tanımlama aslında dini bir terim olarak ortaya çıkmıştır. Kişinin yeniden hayata dönüşüne verilen isimdir Rönesans. Rönesans günümüzde kullanılan haliyle ilk kez 1860 yılında yazarı Jacob Burckhardt'ın kendi çabalarıyla yayımladığı İtalya'da Rönesans Kültürü adlı kitabında karşımıza çıkar. Şimdi daha geriye gidelim ve sizi sıkıcı bir takım tanımlamalardan uzaklaştıralım. Nedir bu Rönesans, ortaya çıkmasının sebepleri nedir ve nelere sebep olmuştur? Rönesans sanatını anlamak için öncelikle Ortaçağ'ı bilmemiz ve anlamamız gerekiyor. Rönesans öncesinde Ortaçağ, Carol Lange, Romanesque ve Gotik gibi sanat takımları hakimdi. Ancak şimdilik bu üç dönemi detaylandırmak yerine genel olarak Ortaçağ'ın karanlığından bahsetmek daha yerinde olacaktır.

# Try to set the response format to 'text'
transcript = client.audio.transcriptions.create(
  model="whisper-1", 
  file=audio_file, 
  response_format="text"
)
print(transcript)

Sanat tarihiyle ilgilenen veya ilgilenmeyen herkesin kulağına mutlaka Rönesans kelimesi çalınmış olmalı. Duyar duymaz aklımıza bilim, sanat ve yenilikler gelir. Ancak bu tanımlama aslında dini bir terim olarak ortaya çıkmıştır. Kişinin yeniden hayata dönüşüne verilen isimdir Rönesans. Rönesans günümüzde kullanılan haliyle ilk kez 1860 yılında yazarı Jacob Burckhardt'ın kendi çabalarıyla yayımladığı İtalya'da Rönesans Kültürü adlı kitabında karşımıza çıkar. Şimdi daha geriye gidelim ve sizi sıkıcı bir takım tanımlamalardan uzaklaştıralım. Nedir bu Rönesans, ortaya çıkmasının sebepleri nedir ve nelere sebep olmuştur? Rönesans sanatını anlamak için öncelikle Ortaçağ'ı bilmemiz ve anlamamız gerekiyor. Rönesans öncesinde Ortaçağ, Carol Lange, Romanesque ve Gotik gibi sanat takımları hakimdi. Ancak şimdilik bu üç dönemi detaylandırmak yerine genel olarak Ortaçağ'ın karanlığından bahsetmek daha yerinde olacaktır.

translation = client.audio.translations.create(
  model="whisper-1", 
  file=audio_file,
  temperature= 0.5
)
print(translation.text)

Everyone who is interested in art history must have heard the word Renaissance. As soon as we hear it, we think of science, art, and innovation. However, this definition has actually emerged as a religious term. Renaissance is the name given to the person's return to life. Renaissance is used today, for the first time in 1860, writer Jacob Burckhardt published it with his own efforts. In his book called Renaissance Culture in Italy, Now let's go back and remove you from boring definitions. What is this Renaissance? What are the reasons for its emergence and what has caused it? To understand the Renaissance art, we must first know and understand the Middle Ages. Before the Renaissance, the Middle Ages The art teams such as Carolingian, Romanesque and Gothic dominated. However, instead of detailing these 3 periods for now, it would be better to talk about the darkness of the Middle Ages.

from pydub import AudioSegment

large_audio_file = "data/large_audio.mp3"

song = AudioSegment.from_file(large_audio_file)

# PyDub handles time in milliseconds
two_minutes = 2 * 60 * 1000

first_2_minutes = song[:two_minutes]

first_2_minutes.export("output/large_audio_1.mp3", format="mp3")

<_io.BufferedRandom name='output/large_audio_1.mp3'>

prompt="ZyntriQix, Digique Plus, CynapseFive, VortiQore V8, EchoNix Array, OrbitalLink Seven, DigiFractal Matrix, PULSE, RAPT, B.R.I.C.K., Q.U.A.R.T.Z., F.L.I.N.T."

audio_file= open("data/audio_with_concepts.m4a", "rb")
transcript = client.audio.transcriptions.create(
  model="whisper-1", 
  file=audio_file,
  language="en"
)
print(f"Output without prompt:\n{transcript.text}")

Output without prompt:
Welcome to our company Zintrik X. Today, we will talk about our new products, DGQ+, Synapse 5, VortiCore V8, Equinix Array, and also we will talk about our existing products and their performance, which are Brick, Quartz, and Flint.

audio_file= open("data/audio_with_concepts.m4a", "rb")
transcript = client.audio.transcriptions.create(
  model="whisper-1", 
  file=audio_file,
  language="en",
  prompt="ZyntriQix, Digique Plus, CynapseFive, VortiQore V8, EchoNix Array, OrbitalLink Seven, DigiFractal Matrix, PULSE, RAPT, B.R.I.C.K., Q.U.A.R.T.Z., F.L.I.N.T."
)
print(f"Output with prompt:\n{transcript.text}")

Output with prompt:
Welcome to our company ZyntriQix. Today we will talk about our new products, Digique Plus, CynapseFive, VortiQore V8, EchoNix Array, and also we will talk about our existing products and their performance, which are B.R.I.C.K., Q.U.A.R.T.Z., and F.L.I.N.T.

system_prompt = "You are a helpful assistant for the company ZyntriQix. Your task is to correct any spelling discrepancies in the transcribed text. Make sure that the names of the following products are spelled correctly: ZyntriQix, Digique Plus, CynapseFive, VortiQore V8, EchoNix Array, OrbitalLink Seven, DigiFractal Matrix, PULSE, RAPT, B.R.I.C.K., Q.U.A.R.T.Z., F.L.I.N.T. Only add necessary punctuation such as periods, commas, and capitalization, and use only the context provided."

def generate_corrected_transcript(system_prompt, content):
    response = client.chat.completions.create(
        model="gpt-4o-mini",
        temperature=0,
        messages=[
            {
                "role": "system",
                "content": system_prompt
            },
            {
                "role": "user",
                "content": content
            }
        ]
    )
    return response.choices[0].message.content

audio_file= open("data/audio_with_concepts.m4a", "rb")
transcript = client.audio.transcriptions.create(
  model="whisper-1", 
  file=audio_file,
  language="en"
)

corrected_text = generate_corrected_transcript(system_prompt, transcript.text)
print(f"Output corrected by GTP-4:\n{corrected_text}")

Output corrected by GTP-4:
Welcome to our company ZyntriQix. Today, we will talk about our new products, Digique Plus, CynapseFive, VortiQore V8, EchoNix Array, and also we will talk about our existing products and their performance, which are B.R.I.C.K., Q.U.A.R.T.Z., and F.L.I.N.T.

Speech to Text¶

Introduction¶

Transcriptions¶

Translations¶

Supported languages¶

Longer inputs¶

Prompting¶

Improving reliability¶

Using the prompt parameter¶

Post-processing with GPT-4¶