felix commited on
Commit
152fc5b
1 Parent(s): 8703711

add 32 from gpt4

Browse files
Files changed (2) hide show
  1. data_set_training.csv +34 -1
  2. system_promts.txt +60 -0
data_set_training.csv CHANGED
@@ -386,4 +386,37 @@ VALLEY HEALTHCARE SYSTEM 1600 FORT BENNING RD, COLUMBUS, GA 31903|1600 FORT BENN
386
  1234 S KINGSHIGHWAY BLVD, ST. LOUIS, MO 63110|1234 S KINGSHIGHWAY BLVD, ST. LOUIS, MO 63109|0
387
  4021 SW 10TH ST, DEERFIELD BEACH, FL 33442|4021 SOUTHWEST 10TH STREET, DEERFIELD BEACH, FL 33442|1
388
  4021 SW 10TH ST, DEERFIELD BEACH, FL 33442|4022 SW 10TH ST, DEERFIELD BEACH, FL 33443|0
389
- 4021 SW 10TH ST, DEERFIELD BEACH, FL 33442|4021 SW 10TH ST, DEERFIELD BEACH, FL 33444|0
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
386
  1234 S KINGSHIGHWAY BLVD, ST. LOUIS, MO 63110|1234 S KINGSHIGHWAY BLVD, ST. LOUIS, MO 63109|0
387
  4021 SW 10TH ST, DEERFIELD BEACH, FL 33442|4021 SOUTHWEST 10TH STREET, DEERFIELD BEACH, FL 33442|1
388
  4021 SW 10TH ST, DEERFIELD BEACH, FL 33442|4022 SW 10TH ST, DEERFIELD BEACH, FL 33443|0
389
+ 4021 SW 10TH ST, DEERFIELD BEACH, FL 33442|4021 SW 10TH ST, DEERFIELD BEACH, FL 33444|0
390
+ 1256 GRAND AVE, SAINT PAUL, MN 55105|1256 GRAND AVENUE, SAINT PAUL, MN 55105|1
391
+ 1256 GRAND AVE, SAINT PAUL, MN 55105|1257 GRAND AVE, SAINT PAUL, MN 55105|0
392
+ 1256 GRAND AVE, SAINT PAUL, MN 55105|1256 GRAND AVE, ST PAUL, MN 55105|1
393
+ 315 N MAIN ST, SPRINGFIELD, IL 62701|315 NORTH MAIN STREET, SPRINGFIELD, IL 62701|1
394
+ 315 N MAIN ST, SPRINGFIELD, IL 62701|316 N MAIN ST, SPRINGFIELD, IL 62701|0
395
+ 4552 W MONROE ST, CHICAGO, IL 60624|4552 WEST MONROE STREET, CHICAGO, IL 60624|1
396
+ 4552 W MONROE ST, CHICAGO, IL 60624|4553 W MONROE ST, CHICAGO, IL 60624|0
397
+ 200 OCEAN AVE, BROOKLYN, NY 11225|200 OCEAN AVENUE, BROOKLYN, NY 11225|1
398
+ 200 OCEAN AVE, BROOKLYN, NY 11225|201 OCEAN AVE, BROOKLYN, NY 11225|0
399
+ 9800 W SUNSET RD, LAS VEGAS, NV 89148|9800 WEST SUNSET ROAD, LAS VEGAS, NV 89148|1
400
+ 9800 W SUNSET RD, LAS VEGAS, NV 89148|9801 W SUNSET RD, LAS VEGAS, NV 89148|0
401
+ 2207 JAMES MADISON HWY, HAYMARKET, VA 20169|2207 JAMES MADISON HIGHWAY, HAYMARKET, VA 20169|1
402
+ 2207 JAMES MADISON HWY, HAYMARKET, VA 20169|2208 JAMES MADISON HWY, HAYMARKET, VA 20169|0
403
+ 7201 S BROADWAY, LITTLETON, CO 80122|7201 SOUTH BROADWAY, LITTLETON, CO 80122|1
404
+ 7201 S BROADWAY, LITTLETON, CO 80122|7202 S BROADWAY, LITTLETON, CO 80122|0
405
+ 123 MAIN ST STE 200, CHICAGO, IL 60601|123 MAIN ST SUITE 200, CHICAGO, IL 60601|1
406
+ 123 MAIN ST STE 200, CHICAGO, IL 60601|123 MAIN ST STE 201, CHICAGO, IL 60601|0
407
+ 123 MAIN ST STE 200, CHICAGO, IL 60601|123 MAIN ST #200, CHICAGO, IL 60601|1
408
+ 456 PARK AVE RM B01, NEW YORK, NY 10022|456 PARK AVE ROOM B01, NEW YORK, NY 10022|1
409
+ 456 PARK AVE RM B01, NEW YORK, NY 10022|456 PARK AVE RM B02, NEW YORK, NY 10022|0
410
+ 456 PARK AVE RM B01, NEW YORK, NY 10022|456 PARK AVE #B01, NEW YORK, NY 10022|1
411
+ 789 BROADWAY APT 3A, BROOKLYN, NY 11211|789 BROADWAY APARTMENT 3A, BROOKLYN, NY 11211|1
412
+ 789 BROADWAY APT 3A, BROOKLYN, NY 11211|789 BROADWAY APT 3B, BROOKLYN, NY 11211|0
413
+ 789 BROADWAY APT 3A, BROOKLYN, NY 11211|789 BROADWAY #3A, BROOKLYN, NY 11211|1
414
+ 1001 WILSHIRE BLVD UNIT 101, LOS ANGELES, CA 90024|1001 WILSHIRE BLVD UNIT 101, LOS ANGELES, CA 90024|1
415
+ 1001 WILSHIRE BLVD UNIT 101, LOS ANGELES, CA 90024|1001 WILSHIRE BLVD UNIT 102, LOS ANGELES, CA 90024|0
416
+ 1001 WILSHIRE BLVD UNIT 101, LOS ANGELES, CA 90024|1001 WILSHIRE BLVD #101, LOS ANGELES, CA 90024|1
417
+ 555 MARKET ST FL 5, SAN FRANCISCO, CA 94105|555 MARKET ST FLOOR 5, SAN FRANCISCO, CA 94105|1
418
+ 555 MARKET ST FL 5, SAN FRANCISCO, CA 94105|555 MARKET ST FL 6, SAN FRANCISCO, CA 94105|0
419
+ 555 MARKET ST FL 5, SAN FRANCISCO, CA 94105|555 MARKET ST #5, SAN FRANCISCO, CA 94105|1
420
+ 888 PEACHTREE ST NE APT 5C, ATLANTA, GA 30309|888 PEACHTREE ST NE APARTMENT 5C, ATLANTA, GA 30309|1
421
+ 888 PEACHTREE ST NE APT 5C, ATLANTA, GA 30309|888 PEACHTREE ST NE APT 5D, ATLANTA, GA 30309|0
422
+ 888 PEACHTREE ST NE APT 5C, ATLANTA, GA 30309|888 PEACHTREE ST NE #5C, ATLANTA, GA 30309|1
system_promts.txt CHANGED
@@ -7,3 +7,63 @@ You are tasked with helping to generate test data for machine learning dataset.
7
 
8
  Generate completely different addresses.
9
  You are tasked with helping to generate test data for machine learning dataset. Do no prefix numbers before each line of output. User is expected to prompt with a sample US address but without the City, State, Zipcode part. As a model your task is to generate 10 random addresses that are inspired by the structure of the address user provided. Address formats should be as if a regular person may enter it into some system.
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
7
 
8
  Generate completely different addresses.
9
  You are tasked with helping to generate test data for machine learning dataset. Do no prefix numbers before each line of output. User is expected to prompt with a sample US address but without the City, State, Zipcode part. As a model your task is to generate 10 random addresses that are inspired by the structure of the address user provided. Address formats should be as if a regular person may enter it into some system.
10
+
11
+
12
+ Your task is to generate training samples based on an evaluation set provided for a neural network training problem.
13
+ The format of the evaluation set as follows.
14
+ 1. A let of rows separated by newlines.
15
+ 2. Each row has the following structure:
16
+ US address 1|US address 2|1
17
+ or
18
+ US address 1|US address 2|0
19
+ The first and second address are compared to see if they are the same address as may have been typed
20
+ by a user into some system. Users cannot alter the format or position of two letter state code or the five digit zipcode.
21
+ All the letters are always capitalized so user inputs are always uppercased for purposes of training data.
22
+ Given the following evaluation set generate 50 rows of training data that would produce a network with high accuracy but
23
+ not so similar that they would cause overfitting to the evaluation set.
24
+ Evaluation set:
25
+ 1061 SCHMIDT LN, NORTH BRUNSWICK TOWNSHIP, NJ 08902|1061 SCHMIDT LANE, NORTH BRUNSWICK TOWNSHIP, NJ 08902|1
26
+ 1061 SCHMIDT LN, NORTH BRUNSWICK TOWNSHIP, NJ 08902|934 SCHMIDT LN, NORTH BRUNSWICK TOWNSHIP, NJ 08902|0
27
+ 115 DR JIMMY CARR ST, STE 3F, SEARCY, AR 72143|115 DR JIMMY CARR ST, SEARCY, AR 72143|0
28
+ 14143 WINECUP LN, HOUSTON, TX 77047|14121 WINECUP LANE, HOUSTON, TX 77047|0
29
+ 1555 RUTH RD STE 5, NORTH BRUNSWICK TOWNSHIP, NJ 08902|1555 RUTH ROAD SUIT 5, NORTH BRUNSWICK TOWNSHIP, NJ 08902|1
30
+ 1555 RUTH RD STE 5, NORTH BRUNSWICK TOWNSHIP, NJ 08902|1558 RUTH ROAD STE 5, NORTH BRUNSWICK TOWNSHIP, NJ 08902|0
31
+ 17752 MAIN ST, HANOVER, PA 23393|177 52 MAIN ST, HANOVER, PA 23393|1
32
+ 17752 MAIN ST, HANOVER, PA 23393|177 52 MAIN STREET, HANOVER, PA 23393|1
33
+ 217-12 LOUDON RD, CONCORD, NH 03301|217-22 LOUDON RD, CONCORD, NH 03301|0
34
+ 2575 US HWY 43, ST 3-A, WINFIELD, AL 35594|25-75 US HWY 43, STREET 3A, WINFIELD, AL 35594|1
35
+ 2575 US HWY 43, ST 3-A, WINFIELD, AL 35594|2575 US HWY 43, ST 3B, WINFIELD, AL 35594|0
36
+ 440 TECHNOLOGY CENTER DRIVE, BOSTON, MA 10034|200 TECHNOLOGY CENTER DRIVE, BOSTON, MA 10034|0
37
+ 440 TECHNOLOGY CENTER DRIVE, BOSTON, MA 10034|440 TECHNOLOGY CENTER DR., BOSTON, MA 10034|1
38
+ 440 TECHNOLOGY CENTER DRIVE, BOSTON, MA 10034|87 TECHNOLOGY CENTER DRIVE, BOSTON, MA 10034|0
39
+ 545 16TH ST, STE 3, GULFPORT, MS 39507|545 16TH ST, FLOOR 3, GULFPORT, MS 39507|0
40
+ 5844 N ORANGE BLOSSOM TRAIL, ORLANDO, FL 32810|5844 NORTH ORANGE BLOSSOM TRAIL, ORLANDO, FL 32810|1
41
+ 65 MOUNTAIN BLVD EXT, WARREN, NJ 07059|112 MOUNTAIN BLVD EXT, WARREN, NJ 07059|0
42
+ 65 MOUNTAIN BLVD EXT, WARREN, NJ 07059|5078 S MARYLAND PKWY, LAS VEGAS, NV 89119|0
43
+ 65 MOUNTAIN BLVD EXT, WARREN, NJ 07059|65 MOUNTAIN BOULEVARD EXT, WARREN, NJ 07059|1
44
+ 6701 FANNIN ST #1400, HOUSTON, TX 77030|6701 FANNIN STE #1400, HOUSTON, TX 77030|1
45
+ 87 24 ROUTE 13, CORTLANDVILLE, NY 13045|87-24 ROUTE 13, CORTLANDVILLE, NY 13045|1
46
+ 87-43 ROUTE 13, CORTLANDVILLE, NY 13045|8724 ROUTE 13, CORTLANDVILLE, NY 13045|0
47
+ 87-44 ROUTE 13, CORTLANDVILLE, NY 13045|87 24 ROUTE 13, CORTLANDVILLE, NY 13045|0
48
+ 872 ROUTE 13, CORTLANDVILLE, NY 13045|87-2 ROUTE 13, CORTLANDVILLE ,NY 13045|1
49
+ 8724 ROUTE 13, CORTLANDVILLE, NY 13045|87-24 ROUTE 13, CORTLANDVILLE, NY 13045|1
50
+ HEART HEALTH, 90 N COLUMBUS AVE, LOUISVILLE, MS 39339|90 N COLUMBUS AVE, LOUISVILLE, MS 39339|1
51
+ 115 34 SHOREWAY DR, QUEENSTOWN, MD 21658|115-43 SHOREWAY DR, QUEENSTOWN, MD 21658|0
52
+ 112 24 SHOREWAY DR, QUEENSTOWN, MD 21658|112-24 SHOREWAY DR, QUEENSTOWN, MD 21658|1
53
+ 3619 S 22ND DR, YUMA, AZ 85364|3636 S 22ND DR, YUMA, AZ 85364|0
54
+ 7325 FRANKLIN BLVD, SACRAMENTO, CA 95823|73235 FRANKLIN BLVD, SACRAMENTO, CA 95823|0
55
+ 3660 MAIN ST, TUCSON, AZ 85721|3701 MAIN ST, TUCSON, AZ 85721|0
56
+ 3910 MAGNET RD, MALVERN, AR 72104|3910 MAGNET RD, STE 206 MALVERN, AR 72104|0
57
+ 15702 OBERLIN RD, RALEIGH, NC 27605|15702 OBERLIN RD FL 1, RALEIGH, NC 27605|1
58
+ 14425 ROOSOVELT AVE APT 322, LA JOLLA, CA 92092|14325 ROOSOVELT AVE, LA JOLLA, CA 92092|0
59
+ 14425 ROOSOVELT AVE APT 322, LA JOLLA, CA 92092|144-25 ROOSOVELT AVE APT 322, LA JOLLA, CA 92092|1
60
+ 14425 ROOSOVELT AVE, LA JOLLA, CA 92092|144-25A ROOSOVELT AVENUE, LA JOLLA, CA 92092|0
61
+
62
+ Training samples:
63
+
64
+ good but now instead of varying one part of the address like the building number vary two or more
65
+ parts of the address where some parts will have differences like the building number may have one
66
+ different character and other variations may be because STREET is spelled STR or include other common
67
+ variations that don't actually change the meaning of the address. Remember to generate both
68
+ positive and negative pairs that will fit the evaluation set. Generate 50 samples
69
+