Enable users to interact with your API
os.getenv("AIHUBMIX_API_KEY").Conversation messages including role and content.
Sampling temperature (0-2). Higher values yield more randomness.
Maximum number of tokens to generate (depends on model).
Top-p nucleus sampling parameter controlling diversity.
Frequency penalty to reduce repetition.
Presence penalty to encourage new topics.
Enable streaming responses for real-time output.
Web search options (only supported by specific search models).