Page MenuHomePhabricator

mfossati (Marco Fossati)
Software Engineer, Structured Content

Today

  • Clear sailing ahead.

Tomorrow

  • Clear sailing ahead.

Wednesday

  • Clear sailing ahead.

User Details

User Since
Jan 6 2022, 7:27 PM (132 w, 3 d)
Availability
Available
LDAP User
Marco Fossati
MediaWiki User
MFossati (WMF) [ Global Accounts ]

Recent Activity

Thu, Jul 18

mfossati moved T368624: [XL] Post-upload job to detect logos from Blocked to Doing on the Structured-Data-Backlog (Current Work) board.

Unblocked: directly load the logo detection model within the same code.

Thu, Jul 18, 1:12 PM · Structured-Data-Backlog (Current Work)

Wed, Jul 17

mfossati updated the task description for T370137: Set up a bot account on Commons to add logo-detection statements .
Wed, Jul 17, 10:49 AM · Structured-Data-Backlog (Current Work)
mfossati changed the status of T370137: Set up a bot account on Commons to add logo-detection statements from Open to In Progress.
Wed, Jul 17, 10:44 AM · Structured-Data-Backlog (Current Work)
mfossati changed the status of T370137: Set up a bot account on Commons to add logo-detection statements , a subtask of T349641: [EPIC] MVP Logo machine detection on Commons, from Open to In Progress.
Wed, Jul 17, 10:44 AM · OKR-Work, UploadWizard, Epic, Structured-Data-Backlog (Current Work)

Tue, Jul 16

mfossati moved T370137: Set up a bot account on Commons to add logo-detection statements from Incoming to Needs Community Liaison Input on the Structured-Data-Backlog (Current Work) board.
Tue, Jul 16, 10:08 AM · Structured-Data-Backlog (Current Work)
mfossati created T370137: Set up a bot account on Commons to add logo-detection statements .
Tue, Jul 16, 10:07 AM · Structured-Data-Backlog (Current Work)
mfossati moved T368624: [XL] Post-upload job to detect logos from Doing to Blocked on the Structured-Data-Backlog (Current Work) board.

Blocked by T364551#9977031.

Tue, Jul 16, 9:58 AM · Structured-Data-Backlog (Current Work)

Mon, Jul 15

mfossati updated subscribers of T364551: [SPIKE] Send an image thumbnail to the logo detection service within Upload Wizard.

Thanks @kevinbazira for the prompt action. @klausman @isarantopoulos , as we're now entering hypothesis work under time constraints, could you please give us an estimate to tackle this request?
The LiftWing endpoint being accessible is a hard requirement for T368624: [XL] Post-upload job to detect logos. CC @AUgolnikova-WMF .

Mon, Jul 15, 4:32 PM · MW-1.43-notes (1.43.0-wmf.12; 2024-07-02), Structured-Data-Backlog (Current Work), Machine-Learning-Team
mfossati updated the task description for T364551: [SPIKE] Send an image thumbnail to the logo detection service within Upload Wizard.
Mon, Jul 15, 3:49 PM · MW-1.43-notes (1.43.0-wmf.12; 2024-07-02), Structured-Data-Backlog (Current Work), Machine-Learning-Team
mfossati changed the status of T364551: [SPIKE] Send an image thumbnail to the logo detection service within Upload Wizard from Open to In Progress.
Mon, Jul 15, 3:47 PM · MW-1.43-notes (1.43.0-wmf.12; 2024-07-02), Structured-Data-Backlog (Current Work), Machine-Learning-Team
mfossati changed the status of T364551: [SPIKE] Send an image thumbnail to the logo detection service within Upload Wizard, a subtask of T349641: [EPIC] MVP Logo machine detection on Commons, from Open to In Progress.
Mon, Jul 15, 3:45 PM · OKR-Work, UploadWizard, Epic, Structured-Data-Backlog (Current Work)

Fri, Jul 12

mfossati updated subscribers of T364551: [SPIKE] Send an image thumbnail to the logo detection service within Upload Wizard.

@matthiasmullie wrote:

Hi @kevinbazira; we finally have an API in production that is supposed to send data to the logo detection service at https://inference-staging.svc.codfw.wmnet:30443/v1/models/logo-detection:predict
It doesn’t fully seem to work, though - it looks like some servers are able to access that uri, while others are not.
E.g.:

curl -H 'X-Wikimedia-Debug: backend=mwdebug1001.eqiad.wmnet' https://commons.wikimedia.org/w/api.php\?action\=mediadetection\&format\=json\&formatversion\=2\&filekey\=1b2a8gxjr6m0.xox7qj.6750701.jpg
{"predictions":[{"filename":"1b2a8gxjr6m0.xox7qj.6750701.jpg","target":"logo","prediction":0.0035,"out_of_domain":0.9965}]}

but:

curl -H 'X-Wikimedia-Debug: backend=k8s-mwdebug' https://commons.wikimedia.org/w/api.php\?action\=mediadetection\&format\=json\&formatversion\=2\&filekey\=1b2a8gxjr6m0.xox7qj.6750701.jpg
{"error":{"code":"http-timed-out","info":"HTTP request timed out.","docref":"See https://commons.wikimedia.org/w/api.php for API usage. Subscribe to the mediawiki-api-announce mailing list at <https://lists.wikimedia.org/postorius/lists/mediawiki-api-announce.lists.wikimedia.org/> for notice of API deprecations and breaking changes."},"servedby":"mw-debug.eqiad.pinkunicorn-b74f6b749-d6pc6"}

We suspect that that internal endpoint is not available to mw k8s nodes - do you know how to make them accessible?

Fri, Jul 12, 3:44 PM · MW-1.43-notes (1.43.0-wmf.12; 2024-07-02), Structured-Data-Backlog (Current Work), Machine-Learning-Team

Thu, Jul 11

mfossati created T369830: [SPIKE] Investigate how Web users discover Commons.
Thu, Jul 11, 3:03 PM · Structured-Data-Backlog
mfossati added a comment to T339129: [L] Periodically regenerate various variable data sets/files.

Also done with https://gitlab.wikimedia.org/repos/structured-data/image-suggestions/-/merge_requests/39 and https://gitlab.wikimedia.org/repos/data-engineering/airflow-dags/-/merge_requests/583.
@matthiasmullie the floor is back to you.

Thu, Jul 11, 11:20 AM · Patch-For-Review, Structured-Data-Backlog (Current Work), Section-Topics, Section-Level-Image-Suggestions, Image-Suggestions
mfossati closed T368167: [M] Extend logo detection metrics to tools besides Upload Wizard, a subtask of T349641: [EPIC] MVP Logo machine detection on Commons, as Resolved.
Thu, Jul 11, 10:30 AM · OKR-Work, UploadWizard, Epic, Structured-Data-Backlog (Current Work)
mfossati closed T368167: [M] Extend logo detection metrics to tools besides Upload Wizard as Resolved.
Thu, Jul 11, 10:30 AM · Structured-Data-Backlog (Current Work)

Wed, Jul 10

mfossati changed the status of T368624: [XL] Post-upload job to detect logos from Open to In Progress.
Wed, Jul 10, 4:11 PM · Structured-Data-Backlog (Current Work)
mfossati changed the status of T368624: [XL] Post-upload job to detect logos, a subtask of T349641: [EPIC] MVP Logo machine detection on Commons, from Open to In Progress.
Wed, Jul 10, 4:11 PM · OKR-Work, UploadWizard, Epic, Structured-Data-Backlog (Current Work)
mfossati moved T368624: [XL] Post-upload job to detect logos from Ready for Development to Doing on the Structured-Data-Backlog (Current Work) board.
Wed, Jul 10, 4:11 PM · Structured-Data-Backlog (Current Work)
mfossati moved T362328: [L] Improve the date field in describe step of upload wizard from Doing to Verify on Production on the Structured-Data-Backlog (Current Work) board.
Wed, Jul 10, 4:10 PM · MW-1.43-notes (1.43.0-wmf.13; 2024-07-09), UploadWizard, Structured-Data-Backlog (Current Work)
mfossati updated the task description for T362328: [L] Improve the date field in describe step of upload wizard.
Wed, Jul 10, 4:10 PM · MW-1.43-notes (1.43.0-wmf.13; 2024-07-09), UploadWizard, Structured-Data-Backlog (Current Work)

Tue, Jul 9

mfossati moved T369053: Changes warnings for "depicts" and "date" in UploadWizard to notices from Code Review to Needs QA on the Structured-Data-Backlog (Current Work) board.
Tue, Jul 9, 11:43 AM · MW-1.43-notes (1.43.0-wmf.13; 2024-07-09), Structured-Data-Backlog (Current Work), UploadWizard
mfossati added a comment to T364374: [L] Prepare image suggestions for a new set of Wikipedias.
  • change section topics DAG's default data quality scripts output

A note that we should tackle this.

Tue, Jul 9, 9:32 AM · Structured-Data-Backlog (Current Work), Image-Suggestions, Section-Level-Image-Suggestions
mfossati updated the task description for T364374: [L] Prepare image suggestions for a new set of Wikipedias.
Tue, Jul 9, 9:31 AM · Structured-Data-Backlog (Current Work), Image-Suggestions, Section-Level-Image-Suggestions

Mon, Jul 8

mfossati added a comment to T362328: [L] Improve the date field in describe step of upload wizard.

Actually partially addressed, see T369053#9962162.

Mon, Jul 8, 5:56 PM · MW-1.43-notes (1.43.0-wmf.13; 2024-07-09), UploadWizard, Structured-Data-Backlog (Current Work)
mfossati added a comment to T369053: Changes warnings for "depicts" and "date" in UploadWizard to notices.

The good candidate (in my opinion) for such change would be a message that should appear for a pre-filled date from EXIF . (per the not-yet-done spec in T362328):

BTW @matthiasmullie I've just downloaded the patch, and this message doesn't show up anymore.

Mon, Jul 8, 5:27 PM · MW-1.43-notes (1.43.0-wmf.13; 2024-07-09), Structured-Data-Backlog (Current Work), UploadWizard
mfossati updated subscribers of T362328: [L] Improve the date field in describe step of upload wizard.

@mfossati I am moving this back to doing as we have one AC missing

@Sneha I think this will be addressed by https://gerrit.wikimedia.org/r/c/mediawiki/extensions/UploadWizard/+/1051758, is that right @matthiasmullie ?

Mon, Jul 8, 4:51 PM · MW-1.43-notes (1.43.0-wmf.13; 2024-07-09), UploadWizard, Structured-Data-Backlog (Current Work)
mfossati added a comment to T368167: [M] Extend logo detection metrics to tools besides Upload Wizard.

Pasting the conclusion here for convenience.

Mon, Jul 8, 1:55 PM · Structured-Data-Backlog (Current Work)
mfossati moved T368167: [M] Extend logo detection metrics to tools besides Upload Wizard from Doing to Code Review on the Structured-Data-Backlog (Current Work) board.
Mon, Jul 8, 1:54 PM · Structured-Data-Backlog (Current Work)
mfossati updated the task description for T368167: [M] Extend logo detection metrics to tools besides Upload Wizard.
Mon, Jul 8, 1:52 PM · Structured-Data-Backlog (Current Work)
mfossati added a comment to T368167: [M] Extend logo detection metrics to tools besides Upload Wizard.

Other edit tags

PAWS

No deletions due to logo except a negligible 0.002 % on Feb 2024.

Mon, Jul 8, 8:24 AM · Structured-Data-Backlog (Current Work)

Fri, Jul 5

mfossati updated the task description for T368167: [M] Extend logo detection metrics to tools besides Upload Wizard.
Fri, Jul 5, 3:48 PM · Structured-Data-Backlog (Current Work)
mfossati added a comment to T368167: [M] Extend logo detection metrics to tools besides Upload Wizard.

Results

Upload Wizard

yearmonthtotal_uploadswith_logo_drdeleted_without_drtotal
202406259
2024053936700.0132090329463764070.0378489597886554760.05105799273503188
2024043994370.036551446160470860.043811664918372610.08036311107884347
2024034067770.082354705403697840.0621962402004046440.1445509456041025
2024023532490.051521731130165970.080679633912622530.1322013650427885
2024013524670.123983238147117310.068942624415902760.19292586256302008
2023123107980.059845944954600740.078829336096113870.1386752810507146
2023113375880.0521345545457776930.053615649845373650.10575020439115135
Fri, Jul 5, 3:48 PM · Structured-Data-Backlog (Current Work)
mfossati updated the task description for T368167: [M] Extend logo detection metrics to tools besides Upload Wizard.
Fri, Jul 5, 3:05 PM · Structured-Data-Backlog (Current Work)
mfossati updated the task description for T368167: [M] Extend logo detection metrics to tools besides Upload Wizard.
Fri, Jul 5, 3:03 PM · Structured-Data-Backlog (Current Work)

Wed, Jul 3

mfossati added a comment to T368624: [XL] Post-upload job to detect logos.
  • have a script running continuously somewhere (on toolforge?) listening to MW events

References:

Wed, Jul 3, 10:13 AM · Structured-Data-Backlog (Current Work)
mfossati awarded T368685: [ 1.43.0-wmf.11 ] TypeError: Cannot convert undefined or null to object in createReferenceWidget a Party Time token.
Wed, Jul 3, 8:17 AM · Wikimedia-production-error, Structured-Data-Backlog, WikibaseMediaInfo

Tue, Jul 2

mfossati moved T362328: [L] Improve the date field in describe step of upload wizard from Code Review to Needs QA on the Structured-Data-Backlog (Current Work) board.
Tue, Jul 2, 1:54 PM · MW-1.43-notes (1.43.0-wmf.13; 2024-07-09), UploadWizard, Structured-Data-Backlog (Current Work)
mfossati updated the task description for T368167: [M] Extend logo detection metrics to tools besides Upload Wizard.
Tue, Jul 2, 8:20 AM · Structured-Data-Backlog (Current Work)

Mon, Jul 1

mfossati changed the status of T368736: Structured Data add reference not working from Open to In Progress.
Mon, Jul 1, 4:35 PM · MW-1.43-notes (1.43.0-wmf.11; 2024-06-25), Structured-Data-Backlog (Current Work), Wikidata, SDC General
mfossati moved T368736: Structured Data add reference not working from Incoming to Code Review on the Structured-Data-Backlog (Current Work) board.
Mon, Jul 1, 4:35 PM · MW-1.43-notes (1.43.0-wmf.11; 2024-06-25), Structured-Data-Backlog (Current Work), Wikidata, SDC General
mfossati added a subtask for T340437: [EPIC] Image suggestions data pipelines maintenance : T368931: Unify CI jobs.
Mon, Jul 1, 3:29 PM · Epic, Structured-Data-Backlog (Current Work), Image-Suggestions, Section-Topics, Essential-Work, Section-Level-Image-Suggestions
mfossati added a parent task for T368931: Unify CI jobs: T340437: [EPIC] Image suggestions data pipelines maintenance .
Mon, Jul 1, 3:29 PM · Structured-Data-Backlog, Section-Topics, Section-Level-Image-Suggestions, Image-Suggestions
mfossati created T368931: Unify CI jobs.
Mon, Jul 1, 3:29 PM · Structured-Data-Backlog, Section-Topics, Section-Level-Image-Suggestions, Image-Suggestions
mfossati added a comment to T368167: [M] Extend logo detection metrics to tools besides Upload Wizard.
NOTE: From a category query, the Android app has 418,151 images uploaded.
Mon, Jul 1, 1:56 PM · Structured-Data-Backlog (Current Work)
mfossati renamed T368167: [M] Extend logo detection metrics to tools besides Upload Wizard from Extend logo detection metrics to tools besides Upload Wizard to [M] Extend logo detection metrics to tools besides Upload Wizard.
Mon, Jul 1, 8:32 AM · Structured-Data-Backlog (Current Work)
mfossati changed the status of T368167: [M] Extend logo detection metrics to tools besides Upload Wizard from Open to In Progress.
Mon, Jul 1, 8:31 AM · Structured-Data-Backlog (Current Work)
mfossati changed the status of T368167: [M] Extend logo detection metrics to tools besides Upload Wizard, a subtask of T349641: [EPIC] MVP Logo machine detection on Commons, from Open to In Progress.
Mon, Jul 1, 8:31 AM · OKR-Work, UploadWizard, Epic, Structured-Data-Backlog (Current Work)

Wed, Jun 26

mfossati moved T366323: Error messages for Title field in upload Wizard from Code Review to Needs QA on the Structured-Data-Backlog (Current Work) board.
Wed, Jun 26, 4:50 PM · MW-1.43-notes (1.43.0-wmf.12; 2024-07-02), UploadWizard, Structured-Data-Backlog (Current Work)
mfossati moved T365406: [common-wmf.5] UploadWizard - to see "Caption/Description is required" warning scrolling up is needed from Code Review to Needs QA on the Structured-Data-Backlog (Current Work) board.
Wed, Jun 26, 4:49 PM · MW-1.43-notes (1.43.0-wmf.12; 2024-07-02), Patch-For-Review, Structured-Data-Backlog (Current Work), UploadWizard
mfossati added a comment to T339129: [L] Periodically regenerate various variable data sets/files.

https://gitlab.wikimedia.org/repos/structured-data/section-topics/-/merge_requests/29 reviewed, looks great to me.

Wed, Jun 26, 9:45 AM · Patch-For-Review, Structured-Data-Backlog (Current Work), Section-Topics, Section-Level-Image-Suggestions, Image-Suggestions

Mon, Jun 24

mfossati awarded T286852: In the "Describe" tab, UW prompts for "Image title" when uploading a PDF. a Fox token.
Mon, Jun 24, 8:29 AM · UploadWizard

Jun 21 2024

mfossati added a subtask for T349641: [EPIC] MVP Logo machine detection on Commons: T368167: [M] Extend logo detection metrics to tools besides Upload Wizard.
Jun 21 2024, 5:56 PM · OKR-Work, UploadWizard, Epic, Structured-Data-Backlog (Current Work)
mfossati added a parent task for T368167: [M] Extend logo detection metrics to tools besides Upload Wizard: T349641: [EPIC] MVP Logo machine detection on Commons.
Jun 21 2024, 5:56 PM · Structured-Data-Backlog (Current Work)
mfossati created T368167: [M] Extend logo detection metrics to tools besides Upload Wizard.
Jun 21 2024, 5:54 PM · Structured-Data-Backlog (Current Work)
mfossati moved T362328: [L] Improve the date field in describe step of upload wizard from Doing to Code Review on the Structured-Data-Backlog (Current Work) board.
Jun 21 2024, 4:52 PM · MW-1.43-notes (1.43.0-wmf.13; 2024-07-09), UploadWizard, Structured-Data-Backlog (Current Work)

Jun 19 2024

mfossati moved T360515: UploadWizard doesn't remember any more "release rights" step decisions from Code Review to Needs QA on the Structured-Data-Backlog (Current Work) board.
Jun 19 2024, 1:43 PM · MW-1.43-notes (1.43.0-wmf.11; 2024-06-25), Regression, Structured-Data-Backlog (Current Work), UploadWizard

Jun 6 2024

mfossati updated the task description for T364374: [L] Prepare image suggestions for a new set of Wikipedias.
Jun 6 2024, 10:52 AM · Structured-Data-Backlog (Current Work), Image-Suggestions, Section-Level-Image-Suggestions

Jun 4 2024

mfossati moved T361045: [L] Improve the "use" step in the upload wizard from Code Review to Needs QA on the Structured-Data-Backlog (Current Work) board.
Jun 4 2024, 5:38 PM · MW-1.43-notes (1.43.0-wmf.9; 2024-06-11), Structured-Data-Backlog (Current Work), UploadWizard

Jun 3 2024

mfossati updated the task description for T364374: [L] Prepare image suggestions for a new set of Wikipedias.
Jun 3 2024, 11:03 AM · Structured-Data-Backlog (Current Work), Image-Suggestions, Section-Level-Image-Suggestions
mfossati closed T350007: [M] Adapt image suggestions to comply with breaking database schema changes as Resolved.
isu = spark.read.table('analytics_platform_eng.image_suggestions_suggestions')
alis = isu.where('section_index is null')
slis = isu.where('section_index is not null')
Jun 3 2024, 10:57 AM · Structured-Data-Backlog (Current Work), Image-Suggestions
mfossati closed T350007: [M] Adapt image suggestions to comply with breaking database schema changes, a subtask of T340437: [EPIC] Image suggestions data pipelines maintenance , as Resolved.
Jun 3 2024, 10:57 AM · Epic, Structured-Data-Backlog (Current Work), Image-Suggestions, Section-Topics, Essential-Work, Section-Level-Image-Suggestions
mfossati reopened T364374: [L] Prepare image suggestions for a new set of Wikipedias, a subtask of T340437: [EPIC] Image suggestions data pipelines maintenance , as In Progress.
Jun 3 2024, 10:56 AM · Epic, Structured-Data-Backlog (Current Work), Image-Suggestions, Section-Topics, Essential-Work, Section-Level-Image-Suggestions
mfossati reopened T364374: [L] Prepare image suggestions for a new set of Wikipedias as "In Progress".
Jun 3 2024, 10:56 AM · Structured-Data-Backlog (Current Work), Image-Suggestions, Section-Level-Image-Suggestions
mfossati closed T364374: [L] Prepare image suggestions for a new set of Wikipedias as Resolved.
Jun 3 2024, 10:55 AM · Structured-Data-Backlog (Current Work), Image-Suggestions, Section-Level-Image-Suggestions
mfossati closed T364374: [L] Prepare image suggestions for a new set of Wikipedias, a subtask of T340437: [EPIC] Image suggestions data pipelines maintenance , as Resolved.
Jun 3 2024, 10:55 AM · Epic, Structured-Data-Backlog (Current Work), Image-Suggestions, Section-Topics, Essential-Work, Section-Level-Image-Suggestions
mfossati updated the task description for T364374: [L] Prepare image suggestions for a new set of Wikipedias.
Jun 3 2024, 10:15 AM · Structured-Data-Backlog (Current Work), Image-Suggestions, Section-Level-Image-Suggestions

May 31 2024

mfossati added a comment to T361045: [L] Improve the "use" step in the upload wizard.

@Etonkovidova @Sneha FYI as of now the patch is reverted, so we won't see the change on beta until we re-merge it.

May 31 2024, 2:12 PM · MW-1.43-notes (1.43.0-wmf.9; 2024-06-11), Structured-Data-Backlog (Current Work), UploadWizard
mfossati moved T366266: Make captions optional when inputting descriptions from Code Review to Needs QA on the Structured-Data-Backlog (Current Work) board.
May 31 2024, 12:52 PM · MW-1.43-notes (1.43.0-wmf.8; 2024-06-04), Structured-Data-Backlog (Current Work)
mfossati changed the status of T364374: [L] Prepare image suggestions for a new set of Wikipedias from Open to In Progress.
May 31 2024, 11:07 AM · Structured-Data-Backlog (Current Work), Image-Suggestions, Section-Level-Image-Suggestions
mfossati changed the status of T364374: [L] Prepare image suggestions for a new set of Wikipedias, a subtask of T340437: [EPIC] Image suggestions data pipelines maintenance , from Open to In Progress.
May 31 2024, 11:07 AM · Epic, Structured-Data-Backlog (Current Work), Image-Suggestions, Section-Topics, Essential-Work, Section-Level-Image-Suggestions
mfossati added a comment to T361061: [M] Update the 'other information' field in upload wizard.

@Etonkovidova @Sneha , the reason why I haven't added that horizontal line is because another one will show up in case of multiple uploads, so I've left it out.

May 31 2024, 10:22 AM · MW-1.43-notes (1.43.0-wmf.8; 2024-06-04), Structured-Data-Backlog (Current Work), UploadWizard

May 30 2024

mfossati changed the status of T362328: [L] Improve the date field in describe step of upload wizard from Open to In Progress.
May 30 2024, 10:10 AM · MW-1.43-notes (1.43.0-wmf.13; 2024-07-09), UploadWizard, Structured-Data-Backlog (Current Work)
mfossati changed the status of T362328: [L] Improve the date field in describe step of upload wizard, a subtask of T358765: [EPIC] Describe step UX improvements in the UW on Commons, from Open to In Progress.
May 30 2024, 10:10 AM · Epic, UploadWizard, Structured-Data-Backlog (Current Work)
mfossati moved T361061: [M] Update the 'other information' field in upload wizard from Code Review to Needs QA on the Structured-Data-Backlog (Current Work) board.
May 30 2024, 10:07 AM · MW-1.43-notes (1.43.0-wmf.8; 2024-06-04), Structured-Data-Backlog (Current Work), UploadWizard

May 29 2024

mfossati moved T361045: [L] Improve the "use" step in the upload wizard from Code Review to Needs QA on the Structured-Data-Backlog (Current Work) board.
May 29 2024, 5:58 PM · MW-1.43-notes (1.43.0-wmf.9; 2024-06-11), Structured-Data-Backlog (Current Work), UploadWizard
KStoller-WMF awarded T364374: [L] Prepare image suggestions for a new set of Wikipedias a Like token.
May 29 2024, 1:23 PM · Structured-Data-Backlog (Current Work), Image-Suggestions, Section-Level-Image-Suggestions
mfossati added a comment to T364374: [L] Prepare image suggestions for a new set of Wikipedias.

Hey @KStoller-WMF , chiming in while @AUgolnikova-WMF is out of office: yes, I'll pick up this ticket next week. Stay tuned!

May 29 2024, 9:32 AM · Structured-Data-Backlog (Current Work), Image-Suggestions, Section-Level-Image-Suggestions

May 28 2024

mfossati updated the task description for T363707: UploadWizard homeButton mal formatted link.
May 28 2024, 3:02 PM · MW-1.43-notes (1.43.0-wmf.8; 2024-06-04), Structured-Data-Backlog (Current Work), UploadWizard

May 27 2024

mfossati moved T361061: [M] Update the 'other information' field in upload wizard from Doing to Code Review on the Structured-Data-Backlog (Current Work) board.
May 27 2024, 2:49 PM · MW-1.43-notes (1.43.0-wmf.8; 2024-06-04), Structured-Data-Backlog (Current Work), UploadWizard

May 22 2024

mfossati changed the status of T361061: [M] Update the 'other information' field in upload wizard from Open to In Progress.
May 22 2024, 3:24 PM · MW-1.43-notes (1.43.0-wmf.8; 2024-06-04), Structured-Data-Backlog (Current Work), UploadWizard
mfossati changed the status of T361061: [M] Update the 'other information' field in upload wizard, a subtask of T358765: [EPIC] Describe step UX improvements in the UW on Commons, from Open to In Progress.
May 22 2024, 3:24 PM · Epic, UploadWizard, Structured-Data-Backlog (Current Work)

May 15 2024

mfossati moved T361050: [XL] Improve how categories field is displayed in the upload wizard from Doing to Code Review on the Structured-Data-Backlog (Current Work) board.
May 15 2024, 2:33 PM · MW-1.43-notes (1.43.0-wmf.6; 2024-05-21), Structured-Data-Backlog (Current Work), UploadWizard
mfossati updated mfossati.
May 15 2024, 2:15 PM
mfossati updated the task description for T364551: [SPIKE] Send an image thumbnail to the logo detection service within Upload Wizard.
May 15 2024, 9:20 AM · MW-1.43-notes (1.43.0-wmf.12; 2024-07-02), Structured-Data-Backlog (Current Work), Machine-Learning-Team

May 14 2024

mfossati added a comment to T363506: Pass image objects to the logo detection service.

We concluded that we will figure out the format after the team figures out the spike (accessing the image and sending a thumbnail to Lift Wing).

See T364551: [SPIKE] Send an image thumbnail to the logo detection service within Upload Wizard

I'd suggest we proceed with a base64 encoded image for now.

With binary being the preferred format, right?

May 14 2024, 2:23 PM · Machine-Learning-Team, Structured-Data-Backlog
mfossati renamed T364551: [SPIKE] Send an image thumbnail to the logo detection service within Upload Wizard from [SPIKE] Resize an image file to 224x224 pixels within Upload Wizard to [SPIKE] Send an image thumbnail to the logo detection service within Upload Wizard.
May 14 2024, 2:21 PM · MW-1.43-notes (1.43.0-wmf.12; 2024-07-02), Structured-Data-Backlog (Current Work), Machine-Learning-Team

May 13 2024

mfossati added a comment to T362749: Deploy logo-detection model-server to LiftWing staging.

Yes, Upload stash shouldn't be accessed directly or indirectly. It is internal to mediawiki and private.

Having it private makes total sense from a user privacy point of view. This would also mean that sending image thumbnails from the stash to Lift Wing is out of the question.

I think that the logo detection service can be exposed through an internal endpoint, so it will be inside WMF’s infrastructure.
Moreover, when an image is sent to the upload stash, there’s a set of already implemented checks including existing duplicates and previously deleted duplicates.

May 13 2024, 2:23 PM · Machine-Learning-Team
mfossati added a comment to T362749: Deploy logo-detection model-server to LiftWing staging.

you can just send over the file to liftwing maybe? (we should consider alternative designs and so on).

See T363506: Pass image objects to the logo detection service.

May 13 2024, 1:16 PM · Machine-Learning-Team

May 10 2024

mfossati updated subscribers of T362749: Deploy logo-detection model-server to LiftWing staging.

@mfossati is there any other way to access the images in the upload stash other than using a cookie. Using a user cookie to access an API doesn't seem like the right way for a production application both from a design as well as a security point of view. An API key/token would seem more appropriate (if there is such an option available).

I agree and have dug deeper in the current request being made to the Upload API: maybe the CSRF token is what we're looking for. See upload_file_in_chunks in the example request code. I can confirm that the Upload Wizard is sending a token parameter in the request.

May 10 2024, 10:10 AM · Machine-Learning-Team
mfossati added a comment to T361049: [XL] Improve the file name, caption, and description fields.

(2) I have some problems testing these two AC:

  • Pre-fill the title using file name if it matches the descriptive criteria, if not leave it blank
  • Update the copy for the current error message for when the user has not entered a descriptive title as show here.
May 10 2024, 8:45 AM · MW-1.43-notes (1.43.0-wmf.5; 2024-05-14), Structured-Data-Backlog (Current Work), UploadWizard
mfossati added a comment to T361049: [XL] Improve the file name, caption, and description fields.

(1) the scope of re-designing Describe step presently doesn't include Additional information from the figma mockup

Chiming in: this will be done in T361061: [M] Update the 'other information' field in upload wizard.

May 10 2024, 8:13 AM · MW-1.43-notes (1.43.0-wmf.5; 2024-05-14), Structured-Data-Backlog (Current Work), UploadWizard

May 9 2024

mfossati updated subscribers of T363506: Pass image objects to the logo detection service.

@mfossati I am in favor of passing the image object in some serialized form.
We would need the upload wizard to send a resized image (224x224) instead of the whole file.

I've opened T364551: [SPIKE] Send an image thumbnail to the logo detection service within Upload Wizard to investigate the feasibility of this solution.

May 9 2024, 3:36 PM · Machine-Learning-Team, Structured-Data-Backlog
mfossati added a comment to T363506: Pass image objects to the logo detection service.

@mfossati We noticed that the user can define the width in the url like in this example http://commons.wikimedia.org/w/index.php?title=Special:FilePath&file=Cambia_logo.png&width=224. If we can use this then it would be sufficient and we can stick with using urls in the request.

Hmm, I've just given it a try and I think it won't work for stashed images, which is a hard requirement for us.

@isarantopoulos @kevinbazira , I think I found how to get a thumbnail from a stashed image. There you go: https://commons.wikimedia.org/wiki/Special:UploadStash/thumb/1awuam969hko.2tkfbz.10893556.png/224px-1awuam969hko.2tkfbz.10893556.png, where 1awuam969hko.2tkfbz.10893556.png is the stash file key. The 224px- prefix is the width size.
Of course, I feel there's a caveat, as it seems that the thumbnail is generated on the fly at request time. Still not optimal, but sounds like a workable solution.

May 9 2024, 3:27 PM · Machine-Learning-Team, Structured-Data-Backlog
mfossati created T364551: [SPIKE] Send an image thumbnail to the logo detection service within Upload Wizard.
May 9 2024, 2:36 PM · MW-1.43-notes (1.43.0-wmf.12; 2024-07-02), Structured-Data-Backlog (Current Work), Machine-Learning-Team
mfossati added a comment to T363506: Pass image objects to the logo detection service.

We would need the upload wizard to send a resized image (224x224) instead of the whole file.

I can imagine we can tackle that from within the Upload Wizard with some JavaScript library. I can create a ticket to look into that if you think this would be the best solution.

May 9 2024, 9:14 AM · Machine-Learning-Team, Structured-Data-Backlog
mfossati awarded T363506: Pass image objects to the logo detection service a Mountain of Wealth token.
May 9 2024, 9:09 AM · Machine-Learning-Team, Structured-Data-Backlog
mfossati added a comment to T363506: Pass image objects to the logo detection service.

If one user sends a request with 50 image URLs and another sends a request with 50 serialized images objects, the latter is likely to exceed the server's request body size limit faster.

Thinking out loud: what about sending multiple requests if the limit is reached? I speculate that 50 uploads are an edge case: if this happens, we could dispatch different requests.

May 9 2024, 8:59 AM · Machine-Learning-Team, Structured-Data-Backlog

May 8 2024

mfossati added a comment to T363506: Pass image objects to the logo detection service.

@mfossati We noticed that the user can define the width in the url like in this example http://commons.wikimedia.org/w/index.php?title=Special:FilePath&file=Cambia_logo.png&width=224. If we can use this then it would be sufficient and we can stick with using urls in the request.

Hmm, I've just given it a try and I think it won't work for stashed images, which is a hard requirement for us.

May 8 2024, 4:53 PM · Machine-Learning-Team, Structured-Data-Backlog
mfossati added a comment to T363506: Pass image objects to the logo detection service.

@isarantopoulos , totally agree, makes a lot of sense.

May 8 2024, 1:15 PM · Machine-Learning-Team, Structured-Data-Backlog