MQL is highly extensible and can integrate with virtually any tool or service to build better detection rules.

📘
Request a function!
Don't see a function you want? Let us know via email or Slack!

`beta.message_screenshot`

beta.message_screenshot() → File

The beta.message_screenshot function takes a screenshot of the message using the message body's HTML section. This screenshot is the same as the one that shows in the Message Preview pane when viewing a message, and is a representation of what the end-user would see. The resulting file can be passed into other File analysis functions, such as file.explode or ml.logo_detect:

// Check for an embedded Microsoft logo
any(ml.logo_detect(beta.message_screenshot()).brands, 
    .name == "Microsoft" and .confidence in ("medium", "high")
)

// Run OCR on a screenshot of the message
any(file.explode(beta.message_screenshot()),
    strings.ilike(.scan.ocr.raw, "*free cooler*")
)

View detection rules that use this function

beta.scan_base64

Overview

The beta.scan_base64 function identifies base64 encoded strings within text content and decodes them into readable strings. This is particularly useful for detecting phishing attacks where threat actors encode sensitive information like recipient email addresses within URLs or QR codes.

Key Use Cases

Detecting recipient information encoded in phishing URLs
Identifying encoded domains or usernames in message content
Analyzing HTML attachments for encoded data

Technical Specification

Syntax


beta.scan_base64(
    text: string,
    encodings: [“ascii”, “utf8”, “utf16” “latin1”, “windows1252”],
    ignore_padding: bool
) -> []string

Parameters

text (required): The input text to scan for base64 encoded content
encodings (optional): Array of string encodings to attempt when decoding base64 content
- Supported values: “ascii”, “utf8”, “utf16” “latin1”, “windows1252”
- Default: [“ascii”, “utf8”, “latin1”, “windows1252”]
ignore_padding (optional): Boolean flag to control base64 padding handling
- Default: false
- When true: Attempts to decode even with missing padding
- When false: Strictly enforces proper base64 padding

Return Value

Returns an array of successfully decoded strings that were found in the input text. If no base64 string is found returns an empty array of type string.

Usage Guide

HTML attachments with recipient email

type.inbound
// other rule criteria
and any(attachments,
    .file_type == "html"
    and any(beta.scan_base64(file.parse_html(.).raw),
            any(recipients.to, strings.icontains(.., .email.email))
    )
)

PDF attachment(s) with URLs containing the recipient domain


type.inbound
// other rule criteria
and any(attachments,
    .file_type == "pdf"
    and any(file.explode(.),
        any(.scan.url.urls,
            any(beta.scan_base64(.url),
                any(recipients.to,
                    strings.icontains(.., .email.domain.root_domain)
                )
            )
        )
    )
)

URL redirect history or fragment containing encoded recipient email

type.inbound
// other rule criteria
and any(body.links,
    any(ml.link_analysis(.).redirect_history,
        any(beta.scan_base64(.query_params),
            any(recipients.to, strings.icontains(.., .email.email))
        )
        or any(beta.scan_base64(.fragment),
               any(recipients.to, strings.icontains(.., .email.email))
        )
    )
)

Attachment(s) with a QR code with URLs containing the recipient email

type.inbound
// other rule criteria
and any(attachments,
        (.file_type in $file_types_images or .file_extension in $file_extensions_macros or .file_type == "pdf")
        and any(file.explode(.),
                any(recipients.to,
                    any(beta.scan_base64(..scan.qr.url.url, ignore_padding=true),
                        strings.icontains(., ..email.email)
                    )
                )
        )
)

File Analysis

Files can be delivered via email in a variety of ways, including directly as an attachment or auto-downloaded via links.

`file.explode`

file.explode(input: File | HTML) -> [FileExplodeOutput]

FileExplode uses Strelka, a file extraction and metadata collection system developed by Target.

Strelka uses a variety of scanners to parse files of a specific flavor and performs data collection and/or file extraction on them. Strelka can recursively extract nested files (like a Word doc within a Zip file), identify malicious scripts, suspicious executables and text, run analysis like OCR and Macro detection, and more. For more information on how Strelka works, see the official Strelka documentation.

For a list of all available scanners, see the Github repo or the official Strelka docs.

View detection rules that use this function

// detect HTML smuggling techniques
any(attachments, .file_extension in~ ('html', 'htm') and
  any(file.explode(.), "unescape" in .scan.javascript.identifiers) 
)

// detect encrypted zip files
any(attachments,
  any(file.explode(.), 
    'encrypted_zip' in .flavors.yara
  )
)

// detect attachments soliciting the user to enable macros using OCR
any(attachments,
  any(file.explode(.),
    strings.icontains(.scan.ocr.raw, "enable macros")
  )
)

// detect macros with auto-open
any(attachments,
  any(file.explode(.),
    any(.scan.vba.auto_exec, . == "AutoOpen")
  )
)

// detect macros calling an exe
any(attachments,
  any(file.explode(.),
    any(.scan.vba.hex, strings.ilike(., "*exe*"))
  )
)

`file.html_screenshot`

file.html_screenshot(input: File) -> File

The file.html_screenshot function takes a screenshot of HTML files so that you can query the image. This allows you to run logo detect on HTML attachments — ml.logo_detect(file.html_screenshot(.)) — or send the result to file.explode, empowering you to run OCR and QR analysis.

any(attachments,
    (
      .file_extension in~ ("html", "htm", "shtml", "dhtml")
      or .file_type == "html"
      or .content_type == "text/html"
    )
    and any(ml.logo_detect(file.html_screenshot(.)).brands,
            .name != null and .confidence in ("medium", "high")
    )
)

`file.oletools`

file.oletools(input: File) -> OleToolsOutput

Oletools, developed by Philippe Lagadec, analyzes Microsoft OLE2 files such as Microsoft Office documents for malware and other suspicious indicators.

Use file.oletools to analyze attachments for malware or suspicious indicators like VBA macros, remote OLE objects, encryption, and more.

View detection rules that use this function

// detect suspicious macros
any(attachments, file.oletools(.).indicators.vba_macros.exists)
any(attachments, file.oletools(.).indicators.vba_macros.risk == "high")

// detect potential attempts to exploit CVE-2021-40444  (https://msrc.microsoft.com/update-guide/vulnerability/CVE-2021-40444)
any(attachments, any(file.oletools(.).relationships, strings.ilike(.target, "*html:http*")))

// detect external OLE object relationships
any(attachments, file.oletools(.).indicators.external_relationships.count > 0)

// detect encrypted Office documents
any(attachments, file.oletools(.).indicators.encryption.exists)

// detect macros that attempt to auto-execute when the document is opened
any(attachments, any(file.oletools(.).macros.keywords, .type == "autoexec"))

// detect suspicious macro source code
any(attachments, strings.ilike(file.oletools(.).macros.vba_code_all_modules, "*kernel32*", "*GetProcessId*"))

`file.parse_eml`

file.parse_eml(input: Attachment) -> MessageDataModel

The file.parse_eml function takes in an EML attachment (file extension .eml or content type message/rfc822) and parses it into an MDM.

any(attachments,
    (.file_extension == "eml" or .content_type == "message/rfc822")
    and strings.icontains(file.parse_eml(.).subject.subject, "invoice")
)

`file.parse_html`

file.parse_html(input: File) -> HTML

The file.parse_html function parses an HTML file from an attachment, returning the full raw along with display_text, inner_text. This empowers detections such as running NLU on the display_text and completing a regex on the HTML without custom scanners or YARA signatures.

any(attachments,
    (
      .file_extension in~ ("html", "htm", "shtml", "dhtml")
      or .file_type == "html"
    )
    and regex.icontains(file.parse_html(.).raw,
                        "fromCharCode",
                        "charCodeAt",
                        "charAt",
                        "parseInt"
    )
)

`file.parse_text`

file.parse_text(input: File) -> ParseTextOutput

The file.parse_text function parses a file from an attachment, returning the decoded text.

any(attachments,
    (
      .file_extension in~ ("html", "htm", "shtml", "dhtml")
      or .file_type == "html"
    )
    and strings.icontains(file.parse_text(.).text, "invoice")
)

Machine Learning functions

`ml.link_analysis`

ml.link_analysis(input: Link | URL, mode="default") → LinkAnalysisOutput

LinkAnalysis analyzes a link and classifies them as benign or suspicious. The service sends suspicious URLs to a headless browser which resolves the effective URL and collects a screenshot. The screenshot is sent to an object detection model to detect brand logos, buttons, and input forms. We chose Phishpedia, an Open Source object detection project as our baseline model architecture.

If any logos are detected, those logos are cropped from the original screenshot and compared to a set of protected brand logos commonly used in credential phishing attacks. Discovered brands are available to MQL, along with summary information about login input boxes or captchas in the screenshot.

mode is an optional argument that alters LinkAnalysis's analysis criteria (see note below). By changing mode from its default of "default" to "aggressive", LinkAnalysis performs extra processing on a link when determining whether to fully analyze the link. For example, LinkAnalysis with mode="aggressive" will fetch the destination link of known common click trackers via HEAD and apply normal analysis criteria to that destination link.

View rules that use this function

// detect links to credential phishing pages
any(body.links, 
    all([ml.link_analysis(.)],
        .credphish.disposition == "phishing"
         and .credphish.brand.confidence in ("medium", "high")
     )
)

// detect any links to credential phishing pages
any(body.links, 
    any([ml.link_analysis(., mode="aggressive")],
        .credphish.disposition == "phishing"
         and .credphish.brand.confidence in ("medium", "high")
     )
)

// detect free subdomain links with a login or captcha
any(body.links, 
    all([ml.link_analysis(.)], (
          .credphish.contains_login
          or .credphish.contains_captcha
     )
     and (
          .effective_url.domain.root_domain in $free_subdomain_hosts
          or .original_url.domain.root_domain in $free_subdomain_hosts
     ))
)

// analyze the final DOM of a link within the body
any(body.links, 
    strings.icontains(ml.link_analysis(.).final_dom.display_text, "Redirect Notice")
    and strings.contains(ml.link_analysis(.).final_dom.display_text, ".zip")
)

📘
Analysis criteria
In order to prevent LinkAnalysis from "clicking" on every link, such as Unsubscribes and one-time password resets, LinkAnalysis uses a URL classification model to determine which links to actually send to the service for analysis.
You can check whether LinkAnalysis submitted, retrieved, or analyzed the target page by inspecting the response in the MQL editor.
If you observe LinkAnalysis analyzing links it shouldn't or not analyzing links it should, please send us an email or post in the Slack Community.

`ml.logo_detect`

ml.logo_detect(input: File) -> [LogoDetectOutput]

LogoDetect uses computer vision to detect common brand logos used in attachment-based credential phishing attacks, such as impersonations of PayPal, Adobe, Microsoft, Outlook, Office365, DocuSign, and more. This includes embedded images in the body of messages as CIDs.

Our object detection model identifies logos, which are then cropped into separate images. These images are passed through a Siamese Neural Network to generate a feature vector. We compare this vector to a database of known logos using a similarity calculation. If the score exceeds a predetermined threshold, we confirm it as the respective brand logo.

For text-based logos, we utilize OCR, a computer vision technique for extracting text from images. Combined with Siamese Networks, this approach ensures comprehensive logo detection.

View Rules that use this function

// detect SharePoint logos in attached images
any(attachments,
    .file_type in ('png', 'jpeg', 'jpg', 'bmp')
    and any(ml.logo_detect(.).brands, .name == "Microsoft SharePoint")
)

// detect DocuSign logos in attached images
any(attachments,
    .file_type in ('png', 'jpeg', 'jpg', 'bmp')
    and any(ml.logo_detect(.).brands, .name == "DocuSign")
)

// detect Norton logos in attached PDFs
any(attachments,
    .file_type == "pdf"
    and any(ml.logo_detect(.).brands, .name == "Norton")
)

List of Supported Brands

Don't see the brand you're looking for? Want to be able to detect your own company's logo? Contact your account team or send an email to [email protected] with a few examples of the logo and we'll get back to you.

ABN Amro
ADP
AT&T
Adobe
Amazon
American Express
Apple
BB&T Corporation
Bank of America
Barclays
Belastingdienst
Benteler
Bol.com
Box
BT
Capital One Bank
Captcha
Chase
ChicagoTitle
Coinbase
DigiD
DHL
Discover
DocuSign
Dropbox
DPD
Ebay
Facebook
FidelityTitle
FirstAm
GeekSquad
Generic Webmail
GLS
Google
GoogleDrive
Gusto
Heroku
HSBC
HSBC Bank
Hulu
ING
IRS
Instagram
Key Bank
KPN
LawyersTitle
Ledger
LinkedIn
Lloyds
M & T Bank
MadisonTitle
Mastercard
Meta
Microsoft
Microsoft Office365
Microsoft OneDrive
Microsoft Outlook
Microsoft SharePoint
Microsoft Teams
NatWest
Navy Federal Credit Union
Netflix
NHS
Norton
Okta
OldRepublicTitle
OVO
PayPal
Post NL
Quickbooks
Rabobank
Rakuten
RBS
Royal Mail
SBB
Silicon Valley Bank
Slack
Spotify
Square
StewartTitle
SunTrust Bank
Swiss Post
Swisscom
TD Bank
TicorTitle
U.S. Bank
UPS
Venmo
Visa
Vodafone
WeTransfer
Wells Fargo
WhatsApp
Ziggo
Zoom

`ml.macro_classifier`

ml.macro_classifier(input: File) → MLMacrosOutput

The Sublime Macro Classifier introduces machine learning in MQL to detect malicious VBA macro attachments. Combining ML and MQL allows users to combine the model output with custom detection logic to surface what matters most while reducing the noise commonly associated with black-box ML approaches.

The classifier uses XGBoost to analyze VBA keywords, file metadata, and Oletools output to predict whether an attachment is likely to cause harm.

Use ml.macro_classifier to detect suspicious VBA macro attachments.

View rules that use this function

// detect malicious VBA macros in Office documents, high confidence
any(attachments, .file_extension in~ ("doc", "docm", "docx", "dot", "dotm", "pptm", "ppsm", "xlm", "xls", "xlsb", "xlsm", "xlt", "xltm", "zip")
    and ml.macro_classifier(.).malicious
    and ml.macro_classifier(.).confidence in ("high")
)

// detect malicious VBA macros in Office documents, low or medium confidence
any(attachments, .file_extension in~ ("doc", "docm", "docx", "dot", "dotm", "pptm", "ppsm", "xlm", "xls", "xlsb", "xlsm", "xlt", "xltm", "zip")
    and ml.macro_classifier(.).malicious
    and ml.macro_classifier(.).confidence in ("low", "medium")
)

`ml.nlu_classifier`

ml.nlu_classifier(input: str) -> NluResult

Natural Language Understanding, or NLU, provides users with a machine learning service to analyze text-based content. The service has three primary capabilities:

Email Classification
Named Entity Recognition
Topic Recognition

Email Classification

The Email Classification component takes a body of text as input and provides Intents and/or Tags.

Intent
Intents are top-level categories describing common language attackers use to carry out phishing attacks.

Name	Description
`bec`	Emails containing urgent language about quick tasks from C-suite, HR, and Accounting Depts.
`callback_scam`	Emails containing language about renewing/purchasing services such as tech support, antivirus, or cryptocurrency.
`cred_theft`	Emails contain language urging users to visit a link leading to a realistic-looking portal that requires their credentials to log in.
`extortion`	Emails meant to intimidate victims with threats of blackmail.
`steal_pii`	Emails requesting updates to billing information, personal identification, and tax returns.
`job_scam`	Deceptive emails disguised as employment offers to dupe students into divulging sensitive data or becoming unwitting accomplices in criminal or fraudulent schemes.

Tags
Tags are subcategories that provide additional context for financial-themed phishing attacks. The service returns the following values:

Name	Description
`invoice`	These emails contain language about viewing invoices via links or attachments.
`payment`	These emails contain language about ACH, EFT, or Wire payments.
`purchase_order`	These emails contain language about Purchase Orders, Requests for Quotation.

Example Usage

type.inbound
and any([body.plain.raw, body.html.inner_text], 
        any(ml.nlu_classifier(.).intents,
            .name == "bec" and .confidence == "high"
        )
)
// first-time sender
and (
  (
    sender.email.domain.root_domain in $free_email_providers
    and sender.email.email not in $sender_emails
  )
  or (
    sender.email.domain.root_domain not in $free_email_providers
    and sender.email.domain.domain not in $sender_domains
  )
)

Entity Recognition

Named Entity Recognition (NER) identifies, tags, and extracts important keywords within a body of text. Users can leverage this output to determine if an email contains language commonly associated with urgency, requests, or financial matters. The available entities are listed below:

Name	Description	Examples
`greeting`	Token(s) that aid in the identification of the recipient	hello, dear
`financial`	Token(s) containing financial details such as payments, bank accounts, or real estate transactions	wire, bank details, ACH payment
`org`	Token(s) containing an organization name	Google, Microsoft
`recipient`	Token(s) representing the recipient of the email. Either a name or a generic designator.	Jane Doe, all
`request`	Token(s) asking the recipient to act on behalf of the sender	"I need you to", "please open"
`salutation`	Token(s) signifying the end of the correspondence, aids in the identification of the sender	thanks, regards
`sender`	Token(s) representing the sender of an email. Either a name or a generic designator.	Ms. Tyrell, IT Department
`urgency`	Token(s) containing language meant to urge recipient to act immediately	ASAP, immediately

Example Usage

type.inbound
and sender.display_name in~ $org_display_names
and any(ml.nlu_classifier(body.current_thread.text).entities, .name == "urgency")
and any(ml.nlu_classifier(body.current_thread.text).entities, .name == "request")

Topic Recognition

The Topic Classification component takes a body of text as input and provides topic classification for email content analysis. It analyzes message content to identify the primary topics and themes present in the email, helping to categorize and understand message intent.

ml.nlu_classifier().topics(string, display_name=sender.display_name, subject=subject.subject) → TopicResponse

// First parameter is required text to analyze
// Optional named parameters provide additional context

Parameters

Required text input (string) - The text content to analyze
display_name (optional) - Sender display name. Defaults to sender.display_name if not provided
subject (optional) - Subject text. Defaults to subject.subject if not provided

When analyzing message body content, use body.current_thread.text to ensure valid input. The function can also analyze OCR text or other content sources.

Supported Topics

The function can identify the following topics:

Business & Professional

"Financial Communications" - Banking, investments, bills, invoices, financial services
"Legal and Compliance" - Legal matters, terms of service, privacy policies, compliance
"Customer Service and Support" - Support tickets, inquiries, feedback requests
"Professional and Career Development" - Job opportunities, training, industry insights
"E-Signature" - Electronic document signing requests and updates

Technology & Security

"Security and Authentication" - Account security, password resets, 2FA, login alerts
"Software and App Updates" - Software changes, new features, bug fixes
"File Sharing and Cloud Services" - Shared files, storage notifications, collaboration
"Secure Message" - Encrypted messaging and confidential communications

Communications & Notifications

"Newsletters and Digests" - Regular content compilations and updates
"Reminders and Notifications" - Event/task reminders, calendar notifications
"Out of Office and Automatic Replies" - Absence notifications, auto-responses
"Bounce Back and Delivery Failure Notifications" - Failed email delivery notices
"Voicemail Call and Missed Call Notifications" - Alerts for voicemails, calls, and missed call notifications

Marketing & Promotions

"Advertising and Promotions" - Marketing emails, sales, product launches
"Events and Webinars" - Event invitations, RSVPs, online/offline gatherings
"Travel and Transportation" - Trip planning, bookings, travel updates

Public & Community

"Government Services" - Official government communications
"Emergency Alerts" - Urgent notifications, weather, public safety
"News and Current Events" - News updates and current affairs
"Political Mail" - Campaign messages, political updates
"Charity and Non-Profit" - Fundraising, volunteer opportunities
"Environmental and Sustainability" - Updates on environmental initiatives and sustainability efforts

Health & Education

"Health and Wellness" - Medical appointments, health insurance, wellness
"Educational and Research" - Learning materials, academic announcements

Entertainment & Social

"Entertainment and Sports" - Movies, music, games, sports updates
"Social Media and Networking" - Social network notifications, connections

Common Use Cases

Basic Usage

type.inbound
and any(ml.nlu_classifier(body.current_thread.text, display_name=sender.display_name, subject=subject.subject).topics,
    .name == "Voicemail Call and Missed Call Notifications"
    and .confidence == "high"
)

Analyze Attachments with OCR

Combine with OCR functionality to analyze text in attachments:

type.inbound
and any(attachments,
    .file_type in ('pdf', 'png', 'jpeg', 'jpg') and
    any(file.explode(.),
        any(ml.nlu_classifier(.scan.ocr.raw).topics,
            .name == "Financial Communications"
            and .confidence == "high"
        )
    )
 )

Negate Topics

Identify out of office auto replies, and email bound back notification message types:

type.inbound
// your detection logic that you would like to exclude OOO replies from
and not any(ml.nlu_classifier(body.current_thread.text).topics,
    .name in (
        "Out of Office and Automatic Replies",
        "Bounce Back and Delivery Failure Notifications"
    )
    and .confidence == "high"
)

Multi-Topic Analysis

Check for multiple topics to further scope a query:

type.inbound
and any(ml.nlu_classifier(body.current_thread.text).topics,
        .name in ("Political Mail")
        and .confidence == "high"
)
and not any(ml.nlu_classifier(body.current_thread.text).topics,
            .name in (
                "Charity and Non-Profit", 
                "News and Current Events", 
                "Government Services"
            )      
            and (.confidence == "high" or .confidence == "medium")
)

Best Practices

Confidence Levels
- Prefer confidence == "high" when topics are critical to a rule or hunt
- Consider medium confidence for supplementary signals
- Avoid using low confidence results in isolation
Topic Combinations
- Consider both presence and absence of topics
- Use with other detection methods for scoping
Performance
- Avoid unnecessary topic analysis on filtered messages
- Consider using other methods for simple text matching
Input Selection
- Provide specific content for targeted analysis
- Consider context when analyzing extracted text

Considerations

It is important to remember that the NLU engine only looks at text. Because of this, it needs additional context to be an adequate detector. For example, attackers may craft an email that looks the same as a password reset for your favorite social network. The NLU engine would classify the text as cred_theft, but it would also do the same for a legitimate password reset email. But pairing it with a First-Time/Unsolicited Sender or LinkAnalysis provides the necessary context to make an effective detector.

Network Analysis

`network.whois`

network.whois(domain: Domain) -> WhoisOutput

network.whois performs a WHOIS lookup for domain registration on the .root_domain field of a Domain. It returns the domain age, registrar information, and timing information about the age of the registration record and when it was retrieved.

This function can be used to identify newly registered domains, by searching for domain age or if a domain is not found. Lookups are performed against Sublime's WHOIS service, which may be delayed by ~24 hours. Since new domains have a slight delay, searching for .found == false will identify both unregistered and newly registered domains. For some detections, the .found == false could be high enough signal.

View rules that use this function

network.whois(sender.email.domain).found == false or
network.whois(sender.email.domain).days_old <= 7

any(body.links, network.whois(.href_url.domain).days_old <= 14)

HTML Parsing

`html.xpath`

html.xpath -> HTMLXPathResult

html.xpath allows you to query the contents of an HTML document (including the body of an email) using XPath syntax

any(html.xpath(body.html, '//a/@href').nodes,
    strings.parse_url(.raw).domain.root_domain != 'example.com'
)

any(html.xpath(body.html, '//h1').nodes,
    regex.icontains(.display_text, 'Docu.?Sign')
)

any(
    html.xpath(ml.link_analysis(.).final_dom,
        '//a/@href'
    ).nodes,
    strings.parse_url(.raw).domain.tld in $suspicious_tlds
)

Profiling with historical context

📘
Behavior of historical functions
The result of historical functions is always relative to the time of the message that is being evaluated. During live processing, this means the latest possible information is available. However, during a backtest, these functions only take into account messages that are seen prior to that point in time. If there's not enough data, some fields like .prevalence may be "unknown". This behavior ensures that during a backtest there's never access to "future" data, which would lead to incorrect results and a false sense of confidence in the efficacy of a rule.
Results are typically and deliberately delayed by several hours, so that the prevalence of a sender can remain as"new"for approximately 8-12 hours.

`profile.by_sender`

profile.by_sender() -> SenderProfile

profile.by_sender uses previously ingested inbound messages to build a profile for messages received from a matching Sender. This profile captures information like the .prevalence of the sender domain within your environment to assess how common or uncommon it is across messages. It also captures information about flagged messages, such as false positives or true positives.

For the profile.by_sender function, the list $free_email_providers is used to determine whether a sender means a matching email or domain. If the value of sender.email.domain.domain is in $free_email_providers, then sender.email.email is used to determine a matching Sender. Otherwise, all messages with a matching sender.email.domain.domain are considered to be from the same Sender. This ensures that for profile.by_sender, a matching Sender covers messages from an organization, instead of an individual.

Using profile.by_sender() to find a first-time sender:

type.inbound
and profile.by_sender().prevalence == "new"

Using lists do find a first-time sender is the same but more verbose:

type.inbound
and (
  (
    sender.email.domain.root_domain in $free_email_providers
    and sender.email.email not in $sender_emails
  )
  or (
    sender.email.domain.root_domain not in $free_email_providers
    and sender.email.domain.domain not in $sender_domains
  )
)

To check against the historical reputation for a sender, check whether a sender has sent at least 1 message flagged as malicious or spam but no confirmed false positives.


type.inbound
and profile.by_sender().any_messages_malicious_or_spam
and not profile.by_sender().any_messages_benign

// Additional logic on the suspicious sender.
and ...

Two more sender profile functions: profile.by_sender_domain and profile.by_sender_email exist if the automatic switching between email and domain is not preferred.

`profile.by_sender_domain`

profile.by_sender_domain() -> SenderProfile

profile.by_sender_domain uses previously ingested inbound messages to build a profile for messages received from a matching sender.email.domain.domain.

type.inbound

// filter by first-seen domains or anomalous domains in your environment
and profile.by_sender_domain().prevalence in ("outlier", "new")

// scrutinize PDF attachments, for example
and any(attachments, .file_extension == "pdf" and ...)

`profile.by_sender_email`

profile.by_sender_email() -> SenderProfile

profile.by_sender_domain uses previously ingested inbound messages to build a profile for messages received from a matching sender.email.email.

type.inbound

// filter by first-seen or anomalous email addresses in your environment
and profile.by_sender_email().prevalence in ("outlier", "new")

Together, profile.by_sender_domain and profile.by_sender_email can be used to tell when a domain is common but the sending email address is new:

type.inbound

// not a free email provider
and sender.email.domain.domain not in $free_email_providers

// domain is common in your environment
and profile.by_sender_domain().prevalence == "common"

// but this is the first time you've received messages from this sender
and profile.by_sender_email().prevalence == "new"

📘Request a function!

beta.message_screenshot

beta.scan_base64

Overview

Key Use Cases

Technical Specification

Syntax

Parameters

Return Value

Usage Guide

HTML attachments with recipient email

PDF attachment(s) with URLs containing the recipient domain

URL redirect history or fragment containing encoded recipient email

Attachment(s) with a QR code with URLs containing the recipient email

File Analysis

file.explode

file.html_screenshot

file.oletools

file.parse_eml

file.parse_html

file.parse_text

Machine Learning functions

ml.link_analysis

📘Analysis criteria

ml.logo_detect

List of Supported Brands

ml.macro_classifier

ml.nlu_classifier

Email Classification

Example Usage

Entity Recognition

Example Usage

Topic Recognition

Parameters

Supported Topics

Business & Professional

Technology & Security

Communications & Notifications

Marketing & Promotions

Public & Community

Health & Education

Entertainment & Social

Common Use Cases

Basic Usage

Analyze Attachments with OCR

Negate Topics

Multi-Topic Analysis

Best Practices

Considerations

Network Analysis

network.whois

HTML Parsing

html.xpath

Profiling with historical context

📘Behavior of historical functions

profile.by_sender

profile.by_sender_domain

profile.by_sender_email

📘
Request a function!

`beta.message_screenshot`

`file.explode`

`file.html_screenshot`

`file.oletools`

`file.parse_eml`

`file.parse_html`

`file.parse_text`

`ml.link_analysis`

📘
Analysis criteria

`ml.logo_detect`

`ml.macro_classifier`

`ml.nlu_classifier`

`network.whois`

`html.xpath`

📘
Behavior of historical functions

`profile.by_sender`

`profile.by_sender_domain`

`profile.by_sender_email`