Documentation
¶
Index ¶
Constants ¶
This section is empty.
Variables ¶
This section is empty.
Functions ¶
This section is empty.
Types ¶
type Error ¶
type Error struct { Err *errors.Error `json:"error"` Service string `json:"service"` // Currently running function, for example "scrape" Meta Meta `json:"meta"` }
Error represents an error that occurred during the processing of a Krang Lambda function.
type Meta ¶
type Meta struct { GroupSlug string `json:"group_slug"` TaskID int64 `json:"task_id"` ScrapeID string `json:"scrape_id"` URL string `json:"url"` SearchTerm string `json:"search_term"` Data map[string]any `json:"data"` }
Meta represents the attributes of a failed task.
type Page ¶ added in v0.0.5
type Page struct { ID primitive.ObjectID `bson:"_id,omitempty" json:"id"` ScrapeID *primitive.ObjectID `bson:"scrape_id" json:"scrape_id"` URL string `bson:"url" json:"url"` GroupSlug string `bson:"group_slug,omitempty" json:"group_slug"` TaskID int64 `bson:"task_id,omitempty" json:"task_id"` SearchTerm string `json:"search_term" bson:"search_term"` RelevancyScore uint `json:"relevancy_score" bson:"relevancy_score"` SiteScore uint `json:"site_score" bson:"site_score"` Scrape Scrape `bson:"scrape" json:"scrape"` UpdatedAt time.Time `bson:"updated_at" json:"updated_at"` CreatedAt time.Time `bson:"created_at" json:"created_at"` }
Page represents an individual task scrape including metadata from the Task.
type Scrape ¶
type Scrape struct { ID primitive.ObjectID `json:"id" bson:"_id,omitempty"` URL string `json:"url" bson:"url"` HTTPStatus int `json:"http_status" bson:"http_status,omitempty"` Content ScrapeContent `json:"content" bson:"content,omitempty"` Metrics ScrapeMetrics `json:"metrics" bson:"metrics,omitempty"` }
Scrape represents an individual scrape of a page and its various metrics.
type ScrapeContent ¶
type ScrapeContent struct { H1 string `json:"h1" bson:"h1"` H2 string `json:"h2" bson:"h2"` Title string `json:"title" bson:"title"` ExternalLinks int `json:"external_links" bson:"external_links"` Body string `json:"body" bson:"body"` }
ScrapeContent represents the HTML markup of a page including any <body> content that's relevant for scoring.
type ScrapeMetrics ¶
type ScrapeMetrics struct { Backlinks int `json:"backlinks" bson:"backlinks"` LoadingTime time.Duration `json:"loading_time" bson:"loading_time"` AhrefsDA int `json:"ahrefs_da" bson:"ahrefs_da"` // Domain Authority MozPA int `json:"moz_pa" bson:"moz_pa"` // Page Authority MozDA int `json:"moz_da" bson:"moz_da"` // Domain Authority MozSpamScore int `json:"moz_spam_score" bson:"moz_spam_score"` MajesticCF int `json:"majestic_cf" bson:"majestic_cf"` // Citation Flow MajesticTF int `json:"majestic_tf" bson:"majestic_tf"` // Trust Flow }
ScrapeMetrics represents the scores and metrics retrieved from Ahrefs, Moz and Majestic.
Click to show internal directories.
Click to hide internal directories.