thread-aware conversational ai with streaming responses
Processes @mentions in Slack threads by reading only the conversation thread containing the mention, maintaining context from prior messages in that thread, and streaming responses back to Slack with millisecond-to-second latency. Uses OpenAI GPT (model version unclear, marketed as 'GPT-5.2') or Anthropic Claude 200K depending on token requirements, with automatic model switching when input exceeds 16K tokens. Supports simultaneous multiple requests unlike ChatGPT's sequential 50-per-3-hour rate limit.
Unique: Implements thread-scoped context reading (not workspace-wide) combined with automatic model switching based on token budget, allowing simultaneous parallel requests without per-user rate limiting — a design choice that prioritizes workspace-level throughput over individual user caps
vs alternatives: Faster than ChatGPT for workspace teams because it eliminates context-switching friction and removes per-user rate limits (50/3hr), but narrower than enterprise LLM platforms because it reads only thread context, not full workspace history
multi-source document and url content extraction with format-agnostic parsing
Extracts and analyzes content from diverse sources (web URLs, PDFs, Google Workspace files, YouTube captions, arXiv papers, Notion pages, uploaded files) by sending extracted text/metadata to LLM backend for analysis. Supports public HTTP/HTTPS URLs, direct PDF links, and OAuth-authenticated Google Docs/Sheets/Slides (per-user OAuth, not workspace service account). YouTube extraction includes standard videos, shorts, and live streams via caption parsing. File uploads support PDF, images, Excel, PowerPoint, Word, CSV, plain text, code files, audio, and video (formats unspecified).
Unique: Combines public URL parsing with OAuth-authenticated Google Workspace access and specialized extractors for YouTube captions and arXiv metadata, all within a single Slack command — a breadth-first approach that trades deep integration (e.g., workspace service accounts) for ease of use
vs alternatives: Broader source coverage than ChatGPT (includes YouTube, arXiv, Notion, Google Workspace) but shallower than enterprise document platforms because OAuth is per-user and no workspace-level service account support exists
slack message editing with automatic re-invocation for query refinement
Allows users to edit the original @mention message and automatically re-invoke Q with the modified input, enabling query refinement without re-typing. When a user edits a message that previously invoked Q, the system detects the edit and generates a new response based on the updated message content. This enables iterative refinement of questions within the same thread.
Unique: Implements automatic re-invocation on message edit rather than requiring explicit regenerate button, allowing seamless query refinement by editing the original message — a workflow optimization that reduces friction for iterative questioning
vs alternatives: More intuitive than ChatGPT's regenerate button because it leverages Slack's native edit affordance, but less discoverable because users may not realize editing triggers re-invocation
workspace-level custom instruction templates with persistent context injection
Stores and applies workspace-level instruction templates that are automatically injected into every Q response, allowing teams to define consistent guidelines for email tone, translation rules, content generation style, or coding standards. Templates are defined once in the Q settings panel and applied to all users in the workspace without per-user configuration. Instructions persist across conversations and are re-applied on every invocation.
Unique: Implements workspace-level instruction injection as a persistent configuration rather than per-request overrides, allowing teams to define once and apply globally — a centralized governance approach that differs from per-user or per-conversation customization
vs alternatives: Simpler than fine-tuning custom models because it requires no ML expertise, but less powerful than system prompts in ChatGPT API because it cannot be dynamically modified per-request or per-user
integrated web search with configurable result limits
Augments Q responses with Google Search results by querying the Google Search API and including 3 results (Entry tier), 5 results (Standard tier), or 10 results (Premium tier) in the LLM context before generating responses. Search integration method (API vs. scraping), result ranking, freshness guarantees, and query construction logic are undocumented. Scope of search (web-wide vs. workspace-specific) is unclear.
Unique: Integrates web search as a tier-gated feature with configurable result limits rather than always-on or user-controlled search, allowing Q to supplement LLM knowledge with current web data without requiring user to manage search queries
vs alternatives: Simpler than ChatGPT's web browsing because search is automatic and transparent, but less flexible because users cannot control search parameters or restrict to specific sources
response control and token optimization with regenerate, continue, and delete actions
Provides post-generation response controls including stop (halt streaming mid-response), continue (extend response), regenerate (new response from same input), delete (remove response and save tokens), and edit-to-regenerate (modify original @mention message to re-invoke Q with new input). These controls allow users to optimize token usage and refine responses without re-typing queries. Delete action explicitly saves tokens by removing the response from context.
Unique: Implements response-level controls (stop, continue, regenerate, delete) as first-class Slack UI buttons rather than requiring text commands, combined with explicit token-saving semantics for delete — a UX-first approach that prioritizes discoverability over command-line efficiency
vs alternatives: More granular than ChatGPT's regenerate button because it includes stop, continue, and delete with token awareness, but less powerful than API-level control because users cannot adjust temperature, top-p, or other generation parameters
multi-language support with automatic language detection and translation
Supports input and output in 'almost all languages' (exact language list undocumented) with automatic detection of input language and generation of responses in the same language. Language support is claimed to be comprehensive but no specific language list, character encoding support, or RTL (right-to-left) language handling is documented. Implementation approach (language detection model, translation layer, or native multilingual LLM) is unknown.
Unique: Implements automatic language detection and response generation in the same language as input, without requiring explicit language selection — a zero-configuration approach that assumes users want responses in their input language
vs alternatives: Simpler than ChatGPT's language selection because it requires no user configuration, but less transparent than explicit language choice because detection failures are silent and may produce unexpected language outputs
workspace-level billing and user assignment with per-workspace subscription model
Implements workspace-level billing where a single subscription covers all users in a Slack workspace, with admin controls to assign specific users to different subscription tiers (Entry, Standard, Premium). Billing is managed at the workspace level, not per-user, allowing teams to share a single subscription. Uninstalling the bot immediately cancels all subscriptions with no mid-term refund option. Free 14-day trial available without credit card; can re-trial for 7+ days after expiration by reinstalling.
Unique: Implements workspace-level billing with per-user tier assignment rather than per-user subscriptions, allowing teams to share a single subscription and assign users to different tiers — a cost-sharing model that differs from per-user SaaS pricing
vs alternatives: Cheaper for teams than individual ChatGPT subscriptions because costs are shared, but less flexible than usage-based billing because all users in a tier have identical limits regardless of actual consumption
+3 more capabilities