Sexual Content
Detects explicit sexual content, suggestive flirting, solicitation, minor-related sexual risk, and adult content distribution.
Moderation Coverage
The platform provides standard capabilities, policy templates, and manual policy controls so teams can integrate quickly and then refine rules for their own business logic. It also supports moderation for both Chinese and English text.
Detects explicit sexual content, suggestive flirting, solicitation, minor-related sexual risk, and adult content distribution.
Covers gambling, drugs, fraud, underground trade, privacy abuse, and circumvention-related violations.
Detects sensitive expressions related to public institutions, political figures, regional conflict, historical controversy, and ideology.
Detects personal attacks, discriminatory insults, hate speech, and hostile expressions.
Detects off-platform diversion, contact info drops, ecommerce promotion, financial marketing, and recruitment spam.
Detects violent harm, terrorism, self-harm risk, and weapons or dangerous goods related content.
Bilingual Moderation
The model supports Chinese social content, community comments, customer-service conversations, danmu, news comments, and other text scenarios. It also supports English moderation with a unified result schema, making it easier for cross-region teams to share one policy set and one API.
Covers common Chinese internet expressions, abbreviations, obfuscated variants, and business content scenarios for communities, content platforms, and LLM applications.
Supports English risk-text detection for overseas products, global communities, and English generated-content moderation, reducing multilingual integration cost.
Whether the input is Chinese or English, the API returns one consistent schema of public labels, categories, and scores for client apps, policy engines, and operations systems.
Policy Controls
Supports allow, block, downgrade, review, and alert templates for different moderation goals and business scenarios.
Policies can also be configured manually by top-level category, public label code, threshold, and traffic source.
New users receive 1,000 free credits, and approved company verification unlocks another 50,000 credits for evaluation and onboarding.
Delivery Modes
A cloud API for text moderation, directly callable by API or HTTP SDK, with high concurrency capacity and a 99.9999% service availability target.
Deploy the moderation package on the customer's own servers and run text moderation inside a LAN or private network for stronger data privacy.
Result Schema
Business-facing definitions, handling advice, and representative detection scope. Three labels are shown here; total labels: 7 Contact support for the full catalog.
sexual_solicitation
The text contains prostitution, sexual service solicitation, or adult traffic diversion.
sexual_teasing
The text contains teasing, suggestive, or borderline sexual expressions.
explicit_sexual_content
The text directly describes sexual acts, sexual crimes, or explicit adult scenes.
Business-facing definitions, handling advice, and representative detection scope. Three labels are shown here; total labels: 8 Contact support for the full catalog.
gambling_and_betting
The text involves gambling platforms, betting activities, or abnormal wagering guidance.
drugs_and_contraband
The text involves drugs, prohibited medicine, or other restricted goods.
fraud_and_blackmarket
The text involves fraud, forgery, pyramid schemes, organized crime, or other black/grey market activities.
Business-facing definitions, handling advice, and representative detection scope. Three labels are shown here; total labels: 5 Contact support for the full catalog.
public_affairs_sensitive
The text involves governments, institutions, regulations, or sensitive public governance discussion.
political_figure_sensitive
The text involves political figures, major public figures, or related controversial expressions.
separatism_and_regional_conflict
The text involves sovereignty disputes, separatist rhetoric, or escalated regional conflict expressions.
Business-facing definitions, handling advice, and representative detection scope. Three labels are shown here; total labels: 2 Contact support for the full catalog.
abusive_attack
The text contains obvious personal attacks, verbal abuse, or malicious provocation.
discriminatory_hate
The text contains discriminatory insults toward groups based on region, gender, profession, appearance, or ethnicity.
Business-facing definitions, handling advice, and representative detection scope. Three labels are shown here; total labels: 5 Contact support for the full catalog.
offsite_lead_gen
The text tries to move users off the current platform through add-friend, DM, or diversion wording.
contact_exchange
The text contains WeChat, QQ, phone numbers, URLs, public accounts, or other contact entry points.
commerce_marketing
The text includes product promotion, ecommerce sales, or training-course marketing.
Business-facing definitions, handling advice, and representative detection scope. Three labels are shown here; total labels: 4 Contact support for the full catalog.
violent_harm
The text involves gore, physical harm, murder, or hired violence.
terror_extremism
The text involves terrorist organizations, terrorist incidents, or extremist content.
self_harm
The text involves suicide, self-harm, or expressions of harming oneself.
Business-facing definitions, handling advice, and representative detection scope. Three labels are shown here; total labels: 2 Contact support for the full catalog.
noisy_text
The text is dominated by gibberish, repetition, spammy filler, or low-information content.
other_risk_signal
Non-standard but noteworthy risk signals are detected and should be judged with additional business rules.
Business-facing definitions, handling advice, and representative detection scope. Three labels are shown here; total labels: 3 Contact support for the full catalog.
safe_pass
No clear policy violation is detected in the current text.
safe_context
The text contains neutral discussion, objective statements, or ordinary mentions that do not directly form a risk.
lifestyle_normal
The text is mainly about daily life, health, parenting, or ordinary social topics.
Delivery Value
Results include top-level risk type, public label code, score, and summary buckets so they can be applied directly to policy actions.
The website, user center, and API share the same result schema, making it easy to move from trial to production integration.
The admin console shows registrations, visits, moderation trends, and remaining user quota for ongoing operations.
The main website, product pages, API docs, and user-facing pages support both Chinese and English for international delivery.
Typical Scenarios
Review LLM-generated text before release and identify toxic, illegal, abusive, or politically sensitive content.
Suitable for posts, comments, DMs, and group chats to quickly identify sexual, political, spam, and abusive content.
Detect high-risk text in live comments and video chat streams to support real-time blocking, warnings, or human review.